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I. Introduction 

The point of these notes, and the lectures from which they came, 
is not to provide a rigorous and complete introduction to microlocal 
analysis — many good ones now exist — but rather to give a quick and 
impressionistic feel for how the subject is used in practice. In particular, 
the philosophy is to crudely axiomatize the machinery of pseudodiffer- 
ential and Fourier integral operators, and then to see what problems 
this enables us to solve. The primary emphasis is on application of com- 
mutator methods to yield microlocal energy estimates, and on simple 
parametrix constructions in the framework of the calculus of Fourier 
integral operators; the rigorous justification of the computations is kept 
as much as possible inside a black box. By contrast, the author has 
found that lecture courses focusing on a careful development of the 
inner workings of this black box can (at least when he is the lecturer) 
too easily bog down in technicality, leaving the students with no notion 
of why one might suffer through such agonies. The ideal education, of 
course, includes both approaches. . . 

A wide range of more comprehensive and careful treatments of this 
subject are now available. Among those that the reader might want to 
consult for supplementary reading are [17], [7], [22], [24], [26], [2], [28], 
[16] (with the last three focusing on the "semi-classical" point of view, 
which is not covered here). Hormander's treatise [11], [12], [13], [14] 
remains the definitive reference on many aspects of the subject. 

Some familiarity with the theory of distributions (or a willingness 
to pick it up) is a prerequisite for reading these notes, and fine treat- 
ments of this material include [11] and [6]. (Additionally, an appendix 
sets out the notation and most basic concepts in Fourier analysis and 
distribution theory.) 

Much of the hard technical work in what follows has been shifted 
onto the reader, in the form of exercises. Doing at least some of them 
is essential to following the exposition. The exercises that are marked 
with a "star" are in general harder or longer than those without, in 
some cases requiring ideas not developed here. 

The author has many debts to acknowledge in the preparation of 
these notes. The students at the CMI/ETH summer school were the 
ideal audience, and provided helpful suggestions on the exposition, as 
well as turning up numerous errors and inconsistencies in the notes (al- 
though many more surely remain). Discussions with Andrew Hassell, 
Michael Taylor, Andras Vasy, and Maciej Zworski were very valuable 
in the preparation of these lectures and notes. Rohan Kadakia kindly 
corrected a number of errrors in the final version of the manuscript. 
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Finally, the author wishes to gratefully acknowledge Richard Melrose, 
who taught him most of what he knows of this subject: a strong influ- 
ence of Melrose's own excellent lecture notes [17] can surely be detected 
here. 

The author would like to thank the Clay Mathematics Institute and 
ETH for their sponsorship of the summer school, and MSRI for its 
hospitality in Fall 2008, while the notes were being revised. The author 
also acknowledges partial support from NSF grant DMS-0700318. 

2. PREQUEL: energy methods and commutators 

This section is supposed to be like the part of an action movie before 
the opening credits: a few explosions and a car chase to get you in the 
right frame of mind, to be followed by a more careful exposition of plot. 

2.1. The Schrodinger equation on R n . Let us consider a solution 
ip to the Schrodinger equation on I x R" : 

(2.1) %- x d t ^ - VV = 0. 

The complex- valued "wavefunction" ip is supposed to describe the time- 
evolution of a free quantum particle (in rather unphysical units). We'll 
use the notation A = —V 2 (note the sign: it makes the operator posi- 
tive, but is a bit non-standard). 

Consider, for any self-adjoint operator A, the quantity 

where (•, •) is the sesquilinear L 2 -inner product on W l . In the usual 
interpretation of QM, this is the expectation value of the "observable" 
A. Since d t ip = iV 2 ip = —iAip, we can easily find the time-evolution of 
the expectation of A : 

dt(AM) = (dt(A)M) + {A(-iA)M) + W, HA)^>. 

Now, using the self-adjointness of A and the sesquilinearity, we may 
rewrite this as 

(2.2) d t (Aip, V>> = (d t (A)i>, V>> + i{[A, A]i/>, ^ 

where [S, T] denotes the commutator ST — TS of two operators (and 
d t (A) represents the derivative of the operator itself, which may have 
time-dependence). Note that this computation is a bit bogus in that 
it's a formal manipulation that we've done without regard to whether 
the quantities involved make sense, or whether the formal integration 
by parts (i.e. the use of the self-adjointness of A) was justified. For 
now, let's just keep in mind that this makes sense for sufficiently "nice" 
solutions, and postpone the technicalities. 
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If you want to learn things about ip(t,x), you might try to use (2.2) 
with a judicious choice of A. For instance, setting A = Id shows that the 
L 2 -norm of ip(t, •) is conserved. Additionally, choosing A = A k shows 
that the H k norm is conserved (see the appendix for a definition of this 
norm). In both these examples, we are using the fact that [A, A] = 0. 

A more interesting example might be the following: set A = d r , the 
radial derivative. We may write the Laplace operator on M, n in polar 
coordinates as 

a = -a? - Zzla, + * 

where Ag is the Laplacian on S*™ -1 ; thus we compute 

[AA]= 2 £-(^. 

Exercise 2.1. Do this computation! (Be aware that d r is not a differ- 
ential operator with smooth coefficients.) 

This is kind of a funny looking operator. Note that A is self- adjoint, 
and d r wants to be anti-self-adjoint, but isn't quite. In fact, it makes 
more sense to replace d r by 

77 — 1 

A = (i/2)(d r -d;) = d r + — , 

which corrects d r by a lower-order term to be anti-self-adjoint. 
Exercise 2.2. Show that 



r 

Trying again, we get by dint of a little work: 

(2.3) [A.^l^ + ^f^. 

provided n, the dimension, is at least 4. 

Exercise 2.3. Derive (2.3), where you should think of both sides as 
operators from Schwartz functions to tempered distributions (see the 
appendix for definitions). What happens if n — 3? If n — 2? Be 
very careful about differentiating negative powers of r in the context 
of distribution theory. . . 

Why do we like (2.3)? Well, it has the very lovely feature that both 
summands on the RHS are positive operators. Let's plug this into (2.2) 
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and integrate on a finite time interval: 

= [ T 2\\r- 1 / 2 f^\\ 2 dt+ (n ~ 1)(n ~ 3) ||r- 3 / 2 ^|| 2 ^ 
Jo 2 

where y represents the (correctly scaled) angular gradient: y = r _1 V$, 
where Vg denotes the gradient on S n ~ L . 

Now, we're going to turn the way we use this estimate on its head, 
relative to what we did with conservation of L 2 and H k norms: the 
Ze/£-hand-side can be estimated by a constant times the H 1 ! 2 norm of 
the initial data. This should be at least plausible for the derivative 
term, since morally, half a derivative can be dumped on each copy of 
u, but is complicated by the fact that d r is not a differential operator on 
]R n with smooth coefficients. The following (somewhat lengthy) pair of 
exercises goes somewhat far afield from the main thrust of these notes, 
but is necessary to justify our 

#1/2 

estimate. 

In the sequel, we employ the useful notation / < g to indicate that 
/ < Cg for some C G when / and g are Banach norms of some 
function, C is always supposed to be independent of the function. 

Exercise* 2.4. 

(1) Verify that for u G S{R n ) with n > 3, \(d r u,u)\ < \\uf H1/2 . 
Hint: Use the fact that 

d r = \x\~ 1 xid : 



X3 ■ 



Check that x/\x\ is a bounded multiplier on both L 2 and H 1 , 
and hence, by interpolation and duality, on H^ 1 / 2 . An efficient 
treatment of the interpolation methods you will need can be 
found in [25]. You will probably also need to use Hardy's in- 
equality (see Exercise 2.5). 
(2) Likewise, show that the {r~ 1 u, u) term is bounded by a multiple 
of ||u||^-i/2 (again, use Exercise 2.5). 

Exercise 2.5. Prove Hardy's inequality: if u G i/ 1 (IR n ) with n > 3, then 



(n -2f -• " 



Vu\ dx. 



4 

Hint: In polar coordinates, we have for u G <S(R n ) 
" dr [ I \u\ 2 r n -' 6 drdO. 



Integrate by parts in the r integral, and apply Cauchy-Schwarz. 
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So we obtain, finally, the Morawetz inequality: if i[)q G i/ 1 / 2 (lR n ), 
with n > 4 then 

(2 ' 4) T 

2 jT llr-V'^H 2 dt + {n ~ ^ ~ 3) £ \\r-^\\ 2 dt < II^IIU- 

Now remember that we've been working rather formally, and there's 
no guarantee that either of the terms on the LHS is finite a priori. But 
the RHS is finite, so since both terms on the LHS are positive, both 
must be finite, provided G if 1 / 2 . (This is a dangerously sloppy way 
of reasoning — see the exercises below.) So one stroke two 

nice pieces of information: if tp G H 1 ^ 2 , we obtain the finiteness of 
both terms on the left. 

Let's try and understand these. The term 

f \\r-^\\ 2 dt 
Jo 

gives us a weighted estimate, which we can write as 

(2.5) ^er 3/2 L 2 ([0,T];L 2 (l n )) 
for any T, or, more briefly, as 

(2.6) V G r^L\ oc L\ 

(The right side of (2.5) denotes the Hilbert space of functions that are 
of the form r 3//2 times an element of the space of L 2 functions on [0, T] 
with values in the Hilbert space L 2 (W a ); note that whenever we use 
the condensed notation (2.6), the Hilbert space for the time variables 
will precede that for the spatial variables.) So ip can't "bunch up" 
too much at the origin. Incidentally, our whole setup was translation 
invariant, so in fact we can conclude 

^e\x-x f 2 Ll c L 2 

for any Xq G M n , and ip can't bunch up too much anywhere at all. 

How about the other term? One interesting thing we can do is the 
following: Choose xq, x\ in R", and let X be a smooth vector field with 
support disjoint from the line x$X\. Then we may write X in the form 

X = X + Xi 

with Xj smooth, and Xj _L (x — Xi) for % — 0, 1; in other words, we split 
X into angular vector fields with respect to the origin of coordinates 
placed at x and x 1 respectively. Moreover, we can arrange that the co- 
efficients of Xj be bounded in terms of the coefficients of X (provided we 
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bound the support uniformly away from xqXi). Thus, we can estimate 
for any such vector field X and any u G C£°(R n ) 



IX-ul 2 dx < 



2 r 2 

\x — xo| _1 ^ 2 y o M dx + J \x — xil -1 ^ 2 ^ 7 ^ 



dx 



where ^ is the angular gradient with respect to the origin of coordi- 
nates at Xi. Since for a solution of the Schrodinger equation, (2.4) tells 
us that the time integral of each of these latter terms is bounded by the 
squared if 1//2 norm of the initial data, we can assemble these estimates 
with the choices X = xd x3 for any \ £ C^°(lR n ) to obtain 

WxVn 2 dt<\\M\mn- 



o 

In more compact notation, we have shown that 

This is called the local smoothing estimate. It says that on average in 
time, the solution is locally half a derivative smoother than the initial 
data was; one consequence is that in fact, with initial data in if 1 / 2 , the 
solution is in H l in space at almost every time. 

Exercise 2.6. Work out the Morawetz estimate in dimension 3. (This 
is in many ways the nicest case.) Note that our techniques yield no 
estimate in dimension 2, however. 

In fact, if all we care about is the local smoothing estimate (and this 
is frequently the case) there is an easier commutator argument that 
we can employ to get just that estimate. Let f(r) be a function on 
R + that equals for r < 1, is increasing, and equals 1 for r > 2. Set 
A = f(r)d r and employ (2.2) just as we did before. The commutant 
f(r)d r (as opposed to just d r ) has the virtue of actually being a smooth 
vector field on M™. So we can write 

[A, f(r)d r ] = -2 f'(r)«9 2 + 2r" 3 /(r)A e + R 

where R is a first order operator with coefficients in C^°(lR n ). As we 
didn't bother to make our commutant anti-self-adjoint, we might like 
to fix things up now by rewriting 

[A, f(r)d r ] = -2d*J'(r)d r + 2r~ 3 f (r)A, + R > 

where R' is of the same type as R. Note that both main terms on the 
right are now nonnegative operators, and also that the term containing 
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d* is not, appearances to the contrary, singular at the origin, owing to 
the vanishing of /' there. Thus we obtain, by another use of (2.2), 



(2.7) 



dt 



dt 



< 



[ r \(R'^^)\dt+\(f(r)d r ^, 
Jo 



Now the first term on the RHS is bounded by a multiple of HV'olljji/a ( as 
R' is first order with coefficients in C£°(IR n )); the second term is likewise 
(since / is bounded with compactly supported derivative, and zero near 
the origin). This gives us an estimate of the desired form, valid on any 
compact subset of supp / fl supp /', which can be translated to contain 
any point. 

Exercise 2.7. This exercise is on giving some rigorous underpinnings 
to some of the formal estimates above. It also gets you thinking about 
the alternative, Fourier-theoretic, picture of how might think about 
solutions to the Schrodinger equation. 1 



(1) Using the Fourier transform, 2 show that if ip G L 
(2) 



n ), there 

exists a unique solution i[)(t,x) to (2.1) with ip(0,x) = ipQ. 
As long as you're at it, use the Fourier transform to derive the 
explicit form of the solution: show that 



ip(t,x) = ip * K t 

where K t is the "Schrodinger kernel;" give an explicit formula 
for K t . 

(3) Use your explicit formula for K t to show that if ipo G L 1 then 
ip(T,x) G L°°(R n ) for any T ^ 0. 

(4) Show using the first part, i.e. by thinking about the solution 
operator as a Fourier multiplier, that if ipo G H 8 then ip(t,x) G 
L°°(R t ; H s ), hence give another proof that H s regularity is con- 
served. 

(5) Likewise, show that the Schrodinger evolution in M n takes Schwartz 
functions to Schwartz functions. 

(6) Rigorously justify the Morawetz inequality if ipo G <S(R n ). Then 
use a density argument to rigorously justify it for ^ £ -ff 1//2 (M n ). 

1 If you want to work hard, you might try to derive the local smoothing estimate 
from the explicit form of the Schrodinger kernel derived below. It's not so easy! 

2 See the appendix for a very brief review of the Fourier transform acting on 
tempered distributions and L 2 -based Sobolev spaces. 
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2.2. The Schrodinger equation with a metric. Now let us change 
our problem a bit. Say we are on an n-dimensional manifold, or even 
just on R™ endowed with a complete non-Euclidean Riemannian metric 
g. There is a canonical choice for the Laplace operator in this setting: 

A = d*d 

where d takes functions to one-forms, and the adjoint is with respect 
to L 2 inner products on both (which of course also involve the volume 
form associated to the Riemannian metric). This yields, in coordinates, 

(2.8) A = -^^V^ 

where Y17j=i ® d x j is the dual metric on forms (hence g l i is the 
inverse matrix to gij) and g denotes det(^j). 

Exercise 2.8. Check this computation! 

Exercise 2.9. Write the Euclidean metric on R 3 in spherical coordi- 
nates, and use (2.8) to compute the Laplacian in spherical coordinates. 

We can now consider the Schrodinger equation with the Euclidean 
Laplacian replaced by this new "Laplace-Beltrami" operator. By stan- 
dard results in the spectral theory of self-adjoint operators, 3 there is 
still a solution in L°°(R; L 2 ) given any L 2 initial data — this generalizes 
our Fourier transform computation in Exercise 2.7 — but its form and 
its properties are much harder to read off. 

Computing commutators with this operator is a little trickier than 
in the Euclidean case, but certainly feasible; you might certainly try 
computing [A,d r + (n — l)/(2r)] where r is the distance from some 
fixed point. 

Exercise 2.10. Write out the Laplace operator in Riemannian polar 
coordinates, and compute [A, d r + (n — l)/ (2r)] near r = 0. 

But what happens when we get beyond the injectivity radius? Of 
course, the r variable doesn't make any sense any more. Moreover, if 
we try to think of d r as the operator of differentiating "along geodesies 
emanating from the origin" then at a conjugate point to 0, we have 
the problem that we're somehow supposed to be be simultaneously 
differentiating in two different directions. One fix for this problem is 
to employ the calculus of pseudodifferential operators, which permits 
us to construct operators that behave differently depending on what 

3 The operator A is manifestly formally self-adjoint, but in fact turns out to be 
essentially self-adjoint on C^°(X) for X any complete manifold. 
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direction we're looking in: we can make operators that separate out 
the different geodesies passing through the conjugate point, and do 
different things along them. 

2.3. The wave equation. Let 

Du = (<9 2 + A)u = 

denote the wave equation on 1 x 1" (recall that A = — X^<)- For 
simplicity of notation, let us consider only real-valued solutions in this 
section. 

The usual route to thinking about the energy of a solution to the 
wave equation is as follows. We consider the integral 

(2.9) = f (Du, d t u) dt 

Jo 

where (•, •) is the inner product on L 2 (IR n ). Then integrating by parts 
in t and in x gives the conservation of 

||^|| 2 +||V M || 2 . 

We can recast this formally as a commutator argument, if we like, by 
considering the commutator with the indicator function of an interval: 

0= / ([D,l [0 , T] {t)d t }u,u)dt. 

The integral vanishes, at least formally, by self-adjointness of □ — it is 
in fact a better idea to think of this whole thing as an inner product 
on R n+1 : 

([D,l [0jT] (t)]d t u,u) Rn+1 . 

Having gone this far, we might like to replace the indicator function 
with something smooth, to give a better justification for this formal 
integration by parts; let xif) be a smooth approximator to the indicator 
function with x' — 4>i ~~ 02 with (pi and 2 nonnegative bump functions 
supported respectively in (— e, e) and (T—e, T+e), with 02( - ) = 4>i{-—T) 
Let A = x(t)dt + dtx(t)- Then we have 



[n,A]=2d tX 'd t + dh' + x'dl 



MICROLOCAL ANALYSIS 11 

and by (formal) anti-self-adjointness of d t (and the fact that u is as- 
sumed real), 

= ([D,A]u,u) Rn+1 = -2(x'd t u,d t u) Rn+1 + 2(x'u,d 2 u) Rn+1 
= -2(tfdtu, d t u) Rn+1 + 2(x'u, V 2 w) Rn+1 
= -2(x'd t u, d t u) Rn+1 - 2(x'V, Vm) p+ i 



= -2/ (j)i{t){\u t \ 2 + \Vu\ 2 )dtdx 
2 / (j)2it)(\u t \ 2 + \Vu\ 2 )dtdx. 



+ 

Thus, the energy on the time interval [T — e, T + e] (modulated by the 
cutoff 02) is the same as that in the time interval [— e, e] (modulated 
by 0i). 

We can get fancier, of course. Finite propagation speed is usually 
proved by considering the variant of (2.9) 

// Du d t u dx dt, 
-T 2 J\x\ 2 <t 2 

with < Ti < T 2 . Integrating by parts gives negative boundary terms, 
and we find that the energy in 

{t = -T l5 \x\ 2 < T 2 } 

is bounded by that in 

{t = -T 2 , \x\ 2 < T 2 }. 

Hence if the solution has zero Cauchy data (i.e. value, time-derivative) 
on the latter surface, it also has zero Cauchy data on the former. 

Exercise 2.11. Go through this argument to show finite propagation 
speed. 

Making this argument into a commutator argument is messier, but 
still possible: 

Exercise* 2.12. Write a positive commutator version of the proof of 
finite propagation speed, using smooth cutoffs instead of integrations 
by parts. (An account of energy estimates with smooth temporal cut- 
offs, in the general setting of Lorentzian manifolds, can be found in [27, 
Section 3].) 

There is of course also a Morawetz estimate for the wave equation! 
(Indeed, this was what Morawetz originally proved.) 
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Exercise* 2.13. Derive (part of) the Morawetz estimate: Let u solve 

Uu = 0, (u,d t u)\ t=0 = (f,g) 
on R™, with n > 4. Show that 

H r " 3/2 HlL? oc (M»+i) ~ WfWm + IMIk 

this is analogous to the weight part of the Morawetz estimate we derived 
for the Schrodinger equation. There is in fact no need for the local L 2 
norm — the global spacetime estimate works too: prove this estimate, 
and use it to draw a conclusion about the long-time decay of a solution 
to the wave equation with Cauchy data in C™(R n ) © C™(R n ). 
Hint: consider ([□, x(t)(d r + (n - l)/(2r))]u,u) Rn+1 . 



3. The pseudodifferential calculus 

Recall that we hoped to describe a class of operators enriching the 
differential operators that would, among other things, enable us to 
deal properly with the local smoothing estimate on manifolds, where 
conjugate points caused our commutator arguments with ordinary dif- 
ferential operators to break down. One solution to this problem turns 
out to lie in the calculus of pseudodifferential operators. 

3.1. Differential operators. What kind of a creature is a pseudodif- 
ferential operator? Well, first let's think more seriously about differ- 
ential operators. A linear differential operator of order m is something 
of the form 

(3.1) P= a a (x)D a 

\a\<m 

where Dj = i~ 1 (<9/<9a>') and we employ "multiindex notation:" 

D a = D^...D^, 

We will always take our coefficients to be smooth: 

a a GC°°(M n ). 

We let 

Diff m (M n ) 

denote the collection of all differential operators of order m on W 1 (and 
will later employ the analogous notation on a manifold). 
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If P G Diff m (IR™) is given by (3.1), we can associate with P a function 
by formally turning differentiation in x^ into a formal variable £j with 

(&,...,£„) eK": 

This is called the "total (left-) symbol" of P; of course, knowing p is 
equivalent to knowing P. Note that p(x,£) is a rather special kind of 
a function on R 2n : it is actually polynomial in the £ variables with 
smooth coefficients. Let us write p = a tot (P). 
Note that 

<7tat ■ P ^P 

is noi a ring homomorpism: we have 

P( 5 = ^Pa{x)D a q p {x)D^ 

and if we expand out this product to be of the form 

7 

then the coefficients c 7 will involve all kinds of derivatives of the qps. 
This is a pain, but on the other hand life would be pretty boring if the 
ring of differential operators were commutative. 

If we make do with less, though, composition of operators doesn't 
look so bad. We let a m (P), the principal symbol of P, just be the 
symbol of the top-order parts of P : 

a m (p) = 

\a\=m 

Note that o~ m (P) is a homogeneous degree-m polynomial in £, i.e., a 
polynomial such that a m {P){x, A£) = \ m a rn (P)(x, £) for A G KL As 
a result, we can reconstruct it from its value at |£| = 1, and it makes 
sense for many purposes to just consider it as a (rather special) smooth 
function on IR n x S' n_1 . It turns out to make more invariant sense to 
regard the principal symbol as a homogeneous polynomial on T*IR n , so 
that once we have scaled away the action of M + , we may regard it as 
a function on S*M. n , the unit cotangent bundle of M n , which is simply 
defined as T*R n /M + (or identified with the bundle of unit covectors in, 
say, the Euclidean metric). To clarify when we are talking about the 
symbol on S^IR™, we define 4 



The reader is warned that this notation is not a standard one. 
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Now it is the case that the principal symbol is a homomorphism: 

Proposition 3.1. For P,Q differential operators of order m resp. m', 

or m +m'{.PQ) = cr m (P)a m ,(Q). 

(and likewise with a). 

Exercise 3.1. Verify this! 

Moreover, the principal symbol has another lovely property that the 
total symbol lacks: it behaves well under change of variables. If y = 
(f)(x) is a change of variables, with a diffeomorphism, and if P is 
a differential operator in the x variables, we can of course define a 
pushforward of P by 

(^p)f = P(07) 

Then in particular, 

k 

hence 

MD») = Dll . . . I® = (± ^D ykl ) ...(± 9 ^D y , 

\fcl=l / \k n =l 

when we again try to write this in our usual form, as a sum of co- 
efficients times derivatives, we end up with a hideous mess involving 
high derivatives of the diffeomorphism <fi. But, if we restrict ourselves to 
dealing with principal symbols alone, the expression simplifies in both 
form and (especially) interpretation: 

Proposition 3.2. If P is a differential operator given by (3.1), and 

y = (f)(x), then 

<r m (<i>*P)(y,ri) = a M~\y)) ( ) " " " ( ) 

\a\=m \ki=l / \fc„ = l / 

where r\ are the new variables "dual" to the y variables. 

This corresponds exactly to the behavior of a function defined on 
the cotangent bundle: if is a diffeomorphism from M" to R™, then it 
induces a map $ = <p* : T*W y ->■ T*M™, and 

a m (^P) = &(<r m (P)). 

Exercise 3.2. Prove the proposition, and verify this interpretation of 
it. 
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Notwithstanding its poor properties, it is nonetheless a useful fact 
that the map 

a tot : P ^p 

is one-to-one and onto polynomials with smooth coefficients; it there- 
fore has an inverse, which we shall denote 

Opi-.p^ P, 

taking functions on T*M ra that happen to be polynomial in the fiber 
variables to differential operators on R n . Opg is called a "quantization" 
map. 5 You may wonder about the £ in the subscript: it stands for 
"left," and has to do with the fact that we chose to write differential 
operators in the form (3.1) instead of as 

P= D a a a (x), 

\a\<m 

with the coefficients on the right. This would have changed the defini- 
tion of o"tot and hence of its inverse. 

Note that Op^x^) = x^ (i.e. the operation of multiplication by x^) 
while Ope(^j) = Dj. 

Why not, you might ask, try to extend this quantization map to a 
more general class of functions on T*R"? This is indeed how we obtain 
the calculus of pseudodifferential operators. The tricky point to keep in 
mind, however, is that for most purposes, it is asking too much to deal 
with the quantizations of all possible functions on T*IR n , so we'll deal 
only with a class of functions that are somewhat akin to polynomials 
in the fiber variables. 

3.2. Quantum mechanics. One reason why you might care about the 
existence of a quantization map, and give it such a suggestive name, 
lies in the foundations of quantum mechanics. 

It is helpful to think about T*M. n as being a classical phase space, with 
the x variables (in the base) being "position" and the £ variables (the 
fiber variables) as "momenta" in the various directions. The general 
notion of classical mechanics (in its Hamiltonian formulation) is as 
follows: The state of a particle is a point in the phase space T*M n , 
and moves along some curve in T*M n as time evolves; an observable 
p(x, £) is a function on the phase space that we may evaluate at the 
state (x, £) of our particle to give a number (the observation). By 
contrast, a quantum particle is described by a complex- valued function 
ip(x) on M n , and a quantum observable is a self-adjoint operator P 
acting on functions on R™. Doing the same measurement repeatedly on 



'It is far from unique, as will become readily apparent. 
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identically prepared quantum states is not guaranteed to produce the 
same number each time, but at least we can talk about the expected 
value of the observation, and it's simply 

(Ptp,ip) L2{Rn) . 

In the early development of quantum mechanics, physicists sought a 
way to transform the classical world into the quantum world, i.e. of 
taking functions on T*M™ to operators on 6 L 2 (M ra ). This is, loosely 
speaking, the process of "quantization." 

We now turn to the question of describing the dynamics in the quan- 
tum and classical worlds. To describe how the point in phase space 
corresponding to a classical particle in Hamiltonian mechanics evolves 
in time, we use the notion of the "Poisson bracket" of two observables. 
In coordinates, we can explicitly define 

if } ! - V d/ dg df dg 
U,9i - 2^ d £. dx j dxidtj 

(this in fact makes invariant sense on any symplectic manifold). The 
map g \-¥ {f,g} defines a vector field 7 (the Hamilton vector field) 
associated to / : 

H =v^— -^-A 

f <K, ().r > <).<■> d£j 

The classical time-evolution is along the flow generated by the Hamilton 
vector field associated to the energy function of our system, i.e. the flow 
along Hh for some given h € C°°(T*lR n ). By contrast, the wavefunction 
for a quantum particle evolves in time according to the Schrodinger 
equation (2.1), with —V 2 in general replaced by a self-adjoint "Hamil- 
tonian operator" H whose principal symbol is the energy function h. 8 
By a mild generalization of (2.2), the time derivative of the expectation 
of an observable A is related to the commutator 

[H,A]. 

One of the essential features of quantum mechanics is that 

<Tm+m'([H,A]) = i{o- m (H),Cr m ,(A)}, 



6 Well, they are not necessarily going to be denned on all of L 2 ; the techni- 
cal subtleties of unbounded self-adjoint operators will mostly not concern us here, 
however. 

7 We use the geometers' convention of identifying a vector and the directional 
derivative along it. 

8 For honest physical applications, one really ought to introduce the semi-classical 
point of view here, carrying Planck's constant along as a small parameter and using 
an associated notion of principal symbol. 
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so that the time-evolution of the quantum observable A is related to 
the classical evolution of its symbol along the Hamilton flow; this is the 
"correspondence principle" between classical and quantum mechanics. 9 

3.3. Quantization. How might we construct a quantization map ex- 
tending the usual quantization on fiber-polynomials? 

Let J 7 denote the Fourier transform (see Appendix for details). Then 
we may write, on M n , 

(D x ^)(x)=T- l ^u = (2ny n J e ix % J e'^y) dy d£ 

i 3 e^H{y)dydi 
Likewise, since = I, we of course have 

(x j ^)(x) = (2n)- n [ [ xie l{x - yH 4>(y)dyd£ 



-ill 



Going a bit further, we see that at least for a fiber polynomial a(x,£) = 

Y2 a a( x )£ a we have 

(3.2) 

(Op t (a)i/>)(x) = J2 a a (x)D a ^(x) = W n JJ a{x^)e l{x - y) H{y)dydi- 

stripping away the function ip, we can also simply write the Schwartz 
kernel (see Appendix) of the operator Op^(a) as 

«(Op,(a)) = (27r)- n J a(x,Oe l{x - yH dt 

(Making sense of the integrals written above is not entirely trivial: 
Given ip e <S(lR ra ), we can make sense of the £ integral in (3.2), which 
looks (potentially) divergent, by observing that 

(1 + |£| 2 r fe (l + Ay) k e i( - x - y K = e^-^ 

for all k G N; repeatedly integrating by parts in y then moves the 
derivatives onto vp. This method brings down an arbitrary negative 
power of (1 + |£| 2 ) at the cost of differentiating ip, thus making the 
£ integral convergent. 10 Similar arguments yield continuity of Op^(a) 



9 In the semi-classical setting, the correspondence principle tells that we can in a 
sense recover CM from QM in the limit when Planck's constant tends to zero. What 
we have in this setting is a correspondence principle that works at high energies, 
i.e. in doing computations with high-frequency waves. 

10 This kind of integration by parts argument is ubiquitous in the subject, and 
somewhat scanted in these notes, relative to its true importance. 
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as a map <S(R n ) — > S(W l ), hence we can extend to let Op^(a) act on 
ip G S' by duality. For more details, cf. [17].) 

Exercise* 3.3. Verify the vague assertions in the parenthetical remark 
above. You may wish to consult, for example, the beginning of [10]. 

This of course suggests that we use (3.2) as the definition of Op^(a) 
for more general observables ("symbols") a. And we do. In M™, we set 

(3.3) (Op e (a)^)(x) = j^y L f a(x,0e iix - y) H(y) dyd£. 

We can define the pseudodifferential operators on W 1 to be just the 
range of this quantization map on some reasonable set of symbols a, to 
be discussed below. 

On a Riemannian manifold, we can make similar constructions global 
by cutting off near the diagonal and using the exponential map and its 
inverse. The pseudodifferential operators are those whose Schwartz 
kernels 11 near the diagonal look like (3.3) in local coordinates, and 
that away from the diagonal are allowed to be arbitrary functions in 
C°°(XxX). If the manifold is noncompact, we will often assume further 
that operators are properly supported, i.e. that both left- and right- 
projection give proper maps from the support of the Schwartz kernel 
to X. 

3.4. The pseudodifferential calculus. 

Definition 3.3. A function a on T*IR n is a classical symbol of order m 
if 

• ae C°°(T*R n ) 

• On [£[ > 1, we have 

a(x,0 = \£\ m a(x,£, |£| -1 ), 
where a is a smooth function on R™ x S'r 1 x R+, and 

i = 4 e S - 

We then write a G S™(T*R n ). 

It is convenient to introduce the notation 

(0 = (i + Kl 2 ) 1/2 , 

so that (£) behaves like [£[ near infinity, but is smooth and nonvanish- 
ing at 0. A fancy way of saying that a is a classical symbol of order 

11 For some remarks on the Schwartz kernel theorem, see the Appendix. 
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m is thus to simply say that a is equal to (£) m times a smooth func- 
tion on the fiberwise radial compactification of T*R n , denoted T R n . 
This compactification is defined as follows: We can diffeomorphically 
identify R^ with the interior of the unit ball by first mapping it to the 
upper hemisphere of S n C R™ +1 by mapping 

(3.4) t"(&>-®) 

and identifying this latter space with the interior of the ball. Then 
becomes a boundary defining function, i.e. one that cuts out the 
boundary nondegenerately as its zero-set; 1/|£| is also a valid bound- 
ary defining function near the boundary of the ball, i.e. away from its 
singularity. 

A very important consequence is that we can write a Taylor series 
for a near = (the "sphere at infinity") to obtain 

oo 

a(x,0 ~ ^a m _,(x,0|^r- j , with a m _, G C°°(R n x S^ 1 ), 

3=0 

and where the tilde denotes an "asymptotic expansion" — truncating 
the expansion at the |£| m-7V term gives an error that is Od^l" 1 "^ 1 ). 12 

If X is a Riemannian manifold, we may define S™(T*X) in the same 
fashion, insisting that these conditions hold in local coordinates. 13 

(For later use, we will also want symbols in a more general geometric 
setting: if E is a vector bundle we define 

S%(E) 

to consist of smooth functions having an asymptotic expansion, as 
above, in the fiber variables. Often, we will be concerned with triv- 
ial examples like E — R™ x R|, where we will usually use Greek letters 
to distinguish the fiber variables.) 

The classical symbols are the functions that we will "quantize" into 
operators using the definition (3.3). As with fiber-polynomials, the 
symbol that we quantize to make a given operator will transform in a 
complicated manner under change of variables, but the top order part 
of the symbol, a m (x, £) G C°°(S*M. n ), will transform invariantly. 



This does not, of course, mean that the series has to converge, or, if it converges, 
that it has to converge to a : we never said a had to be analytic in |£| , after all. 

13 One should of course check that the conditions for being a classical symbol are 
in fact coordinate invariant. 



20 



JARED WUNSCH 



Exercise 3.4. We say that a function a G C°°{T*X) is a Kohn-Nirenberg 
symbol of order m on T*X (and write a G «S'k N (T'*X)) if for all a, (3, 

(3.5) sup (£) l/5|_m |^a| = C a , p < oo. 

Check that S^ c (T*R n ) C S£ N (T*R n ), where the extra subscript c 
denotes compact support in the base variables. Find examples of Kohn- 
Nirenberg symbols compactly supported in x that are not classical 
symbols. 14 

In the interests of full disclosure, it should be pointed out that it 
is the Kohn-Nirenberg symbols, rather than the classical ones defined 
above, that are conventionally used in the definition of the pseudodif- 
ferential calculus. 

At this point, as discussed in the previous section, we are in a position 
to "define" the pseudodifferential calculus as sketched at the end of the 
previous section: it consists of operators whose Schwartz kernels near 
the diagonal look like the quantizations of classical symbols, and away 
from the diagonal are smooth. While our quantization procedure so 
far has been restricted to R n , the theory is in fact cleanest on compact 
manifolds, so we shall state the properties of the calculus only for X 
a compact n-manifold. 15 Most of the properties continue to hold on 
noncompact manifolds provided we are a little more careful either to 
control the behavior of the symbols at infinity, or if we restrict ourselves 
to "properly supported" operators, where the projections to each factor 
of the support of the Schwartz kernels give proper maps. We will 
therefore not shy away from pseudodifferential operators on R n , for 
instance, even though they are technically a bit distinct; indeed we 
will only use them in situations where we could in fact localize, and 
work on a large torus instead. 

Instead of trying to make a definition of the calculus and read off its 
properties, we shall simply try to axiomatize these objects: 

The space of pseudodifferential operators W*(X) on a 

COMPACT MANIFOLD X ENJOYS THE FOLLOWING PROPERTIES. (Note 

that this enumeration is followed by further commentary.) 

(I) (Algebra property) ^> m (X) is a vector space for each m G R. 
If A G ^ m (X) and B G V m '(X) then AB G ^ m+m '(X). Also, 
A* G ^/ m (X). Composition of operators is associative and dis- 
tributive. The identity operator is in ty°(X). 

14 Note that most authors use S m to denote S^ N . 

15 Some remarks about the noncompact case will be found in the explanatory 
notes that follow. 
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(II) (Characterization of smoothing operators) We let 

= p|^ m (X); 

m 

the operators in \& _00 (X) are exactly those whose Schwartz ker- 
nels are C°° functions on X x X, and can also be characterized 
by the property that they map distributions to smooth func- 
tions on X. 

(III) (Principal symbol homomorphism) There is family of linear 
"principal symbol maps" a m : ^ m (X) ->■ C°°(S*X) such that if 
A G ^ m (X) and 5 G ^ m '(X), 

(AS) = a m (A)a m i(B) 

and 

6- m (A*) = a m (A) 

We think of the principal symbol either as a function on the unit 
cosphere bundle S*X or as a homogeneous function of degree 
m on T*X, depending on the context, and we let a m (A) denote 
the latter. 

(IV) (Symbol exact sequence) There is a short exact sequence 

V m -\X) ->■ ^ m (X) ^ C°°(S*X) ->■ 0, 

hence the principal symbol of order m is if and only if an 
operator is of order m — 1. 
(V) There is a linear "quantization map" Op : S%(T*X) ->■ ^ m (X) 
such that if a ~ a m _ 3 -(x, 0^1™^ e S C 7(T*X) then 

<r m (Op(a)) = a m (x,i). 

The map Op is onto, modulo ^~°°(X). 
(VI) (Symbol of commutator) If A G ^ m (X), B G W m ' (X) then 16 
[A, B] G * m+m '- 1 (X), and we have 

a m+m >([A,B]) = i{a m (a),a m ,(b)}. 

(VII) (L 2 -boundedness, compactness) If A = Op (a) G ^°(X) then 
A : £ 2 (V) — > L 2 (X) is bounded, with a bound depending on 
finitely many constants C a ^ in (3.5). Moreoever, if A G \I/ m (X), 
then 

A G £{H s (X),H s - m (X)) for all s G R. 



'That the order is m + ml — 1 follows from Properties (III), (IV). 
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Note in particular that A maps C°°(X) — > C°°(X). As a further 
consequence, note that operators of negative order are compact 
operators on L 2 (X). 

(Asymptotic summation) Given Aj e ^ m ~^{X), with j e N, 
there exists A e \& m (X) such that 

^~£^-> 

which means that 

A 7 " 

A-£a,- e * m - JV_1 (X) 

for each A 7 ". 

(Microsupport) Let A = Op(a) + R, R e *-°°(A). The set 
of (x ,|o) e S*X such that a(x,f) = 0(1^"°°) for x,£ in 
some neighborhood of (a?o>£o) * s well-defined, independent of 
our choice of quantization map. Its complement is called the 
microsupport of A, and is denoted WF' A. We moreover have 

WF' AB C WF' A n WF' 5, WF'(A + £) C WF' A U WF' B, 

WF' A* = WF' A. 
The condition WF' A = is equivalent to A e ^°°(X). 
Commentary: 

(I) If we begin by defining our operators on lR n by the formula 
(3.3), with a G S™(T*~R n ), it is quite nontrivial to verify that 
the composition of two such operators is of the same type; like- 
wise for adjoints. Much of the work that we are omitting in 
developing the calculus goes into verifying this property. 
(II) On a non-compact manifold, it is only among, say, properly 
supported operators that elements of ty~°°(X) are characterized 
by mapping distributions to smooth functions. 
(Ill) Note that there is no sensible, invariant, way to associate, to 
an operator A, a "total symbol" a such that A = Op(a). As 
we saw before, a putative "total symbol" even for differential 
operators would be catastrophically bad under change of vari- 
ables. Moreover, as we also saw for differential operators, it's a 
little hard to see what the total symbol of the composition is. 
This principal symbol map is a compromise that turns out to be 
extremely useful, especially when coupled with the asymptotic 
summation property, in making iterative arguments. 
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(IV) A good way to think of this is that a m is just the obstruction 

to an operator in \l> m (X) being of order m — 1. 
(V) The map Op is far from unique. Even on M. n , for instance, 
we can use Op^ as defined by (3.2) but we could also use the 
"Weyl" quantization 

(Op w (a)^)(x) = W n JJ a((x + y)/2,0e l{x - yH ^(y)dydt 

or the "right" quantization 

(Op r (a)^)(x) = i^)~ n JJ a(y,Oe* {x - yH Hy)dydt 

or any of the obvious interpolating choices. On a manifold 
the choices to be made are even more striking. One conve- 
nient choice that works globally on a manifold is what might be 
called "Riemann-Weyl" quantization: Fix a Riemannian met- 
ric g. Given a G S™(T*X), define the Schwartz kernel of an 
operator A by 

K(A)(x,y) = {2ir)- n J X (x, y)a(m(x, y), 0^ cxp * ^) <% 

here x is a cutoff localizing near the diagonal and in particular, 
within the injectivity radius; m(x,y) denotes the midpoint of 
the shortest geodesic between x, y, exp denotes the exponential 
map, and the round brackets denote the pairing of vectors and 
covectors. The "Weyl" in the name refers to the evaluation of a 
at m(x, y) as opposed to x or y (which give rise to corresponding 
"left" and "right" quantizations respectively — also acceptable 
choices). The "Riemann" of course refers to our use of a choice 
of metric. 

We will often only employ a single simple consequence of the 
existence of a quantization map: given a m G C°°(S*X) and 
m G R, there exists A G ^ m (X) with principal symbol a m and 
with WF' A = supp a m . 
(VI) A priori of course AB — BA G ~ty m+m (X); however the principal 
symbol vanishes, by the commutativity of C co (S*X). Hence the 
need for a lower-order term, which is subtler, and noncommu- 
tative. That the Poisson bracket is well-defined independent of 
coordinates reflects the fact that T*X is naturally a symplec- 
tic manifold, and the Poisson bracket is well-defined on such a 
manifold (see §4.1 below). 
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Exercise 3.5. Check (by actually performing a change of coor- 
dinates) that if f,g G C°°(T*X), then {f,g} is well-defined, 
independent of coordinates. 

This property is the one which ties classical dynamics to 
quantum evolution, as the discussion in §3.2 shows. 

(VII) Remarkably, the mapping property is one that can be derived 
from the other properties of the calculus purely algebraically, 
with the only analytic input being boundedness of operators in 
ty~°°(X). This is the famous Hormander "square-root" argument- 
see [10], as well as Exercise 3.12 below. 

On noncompact manifolds, restricting our attention to prop- 
erly supported operators gives boundedness L 2 — > Lf oc . 

The compactness of negative order operators of course follows 
from boundedness, together with Rellich's lemma, but is worth 
emphasizing; we can regard <j as the "obstruction to compact- 
ness" in general. On noncompact manifolds, this compactness 
property fails quite badly, resulting in much interesting mathe- 
matics. 

(VIII) This follows from our ability to do the corresponding "asymp- 
totic summation" of total symbols, which in turn is precisely 
"Borel's Lemma," which tells us that any sequence of coef- 
ficients are the Taylor coefficients of a C°° function; here we 
are applying the result to smooth functions on the radial com- 
pactification of T*X, and the Taylor series is in the variable 
a = |£| \ at a = 0. 
(IX) Since the total symbol is not well-defined, it is not so obvious 
that the microsupport is well-defined; verifying this requires 
checking how the total symbol transforms under change of coor- 
dinates; likewise, we may verify that the (highly non-invariant) 
formula for the total symbol of the composition respects micro- 
supports to give information about WF' AB. 

3.5. Some consequences. If you believe that there exists a calculus of 
operators with the properties enumerated above, well, then you believe 
quite a lot! For instance: 

Theorem 3.4. Let P e ty m (X) with a m (P) nowhere vanishing on 
S*X. Then there exists Q e ^~ m (X) such that 

QP-I,PQ-I e V-°°(X). 

In other words, P has an approximate inverse ( "parametrix" ) which 
succeeds in inverting it modulo smoothing operators. 
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An operator P with nonvanishing principal symbol is said to be 
elliptic. Note that this theorem gives us, via the Sobolev estimates 
of (VII), the usual elliptic regularity estimates. In particular, we can 
deduce 

Pu G C°°(X) ==>- u G C°°(X). 
Exercise 3.6. Prove this. 

Proof. Let g_ m = (l/a m (P))\ let Q- m G ^~ m (X) have principal sym- 
bol q-m- (Such an operator exists by the exactness of the short exact 
symbol sequence.) Then by (III), 

0-o{PQ-m) = 1, 

hence by (IV), 17 

Now we try to correct for this "error term:" pick Q_ m _i G ^~ m ~ 1 (X) 
with 

0--m-l(Q- m -l) = -0-_l(itLi)/(7 m (P). 

Then we have 

P(Q-m + Q-m-l) ~ I = R-2 G ^ 2 (X). 

Continuing iteratively, we get a series of Qj G ^~ m ~- ? such that 

P(Q_ m + • • • + Q- m - N ) -Ie *- JV " 1 (X). 
Using (VIII), pick 

— oo 
j=-m 

This gives the desired parametrix: 

Exercise 3.7. 

(1) Check that PQ - I G 

(2) Check that QP - I e * _0O (X). (Hint: First check that a 
left parametrix exists; you may find it helpful to take adjoints. 
Then check that the left parametrix must agree with the right 
parametrix.) 

□ 



The identity operator has principal symbol equal to 1, since the symbol map 
is a homomorphism. 
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Exercise 3.8. Show that an elliptic pseudodifferential operator on a 
compact manifold is Fredholm. (Hint: You can show, for instance, 
that the kernel is finite dimensional by observing that the existence of 
a parametrix implies that the identity operator on the kernel is equal 
to a smoothing operator, which is compact.) 

Exercise* 3.9. 

(1) Let X be a compact manifold. Show that if P G fy m (X) is 
elliptic, and has an actual inverse operator as a map from 
smooth functions to smooth functions, then G ^~ m (X). 
(Hint: Show that the parametrix differs from the inverse by an 
operator in ^"^(X) — remember that an operator is in fy~°°(X) 
if and only if it maps distributions to smooth functions.) 

(2) More generally, show that if P G \E ,m (X) is elliptic, then there 
exists a generalized inverse of P, inverting P on its range, map- 
ping to the orthocomplement of the kernel, and annihilating the 
orthocomplement of the range, that lies in \P~ m (X). 

Exercise* 3.10. Let X be compact, and P an elliptic operator on X, 
as above, with positive order. Using the spectral theorem for compact, 
self-adjoint operators, show that if P* = P, then there is an orthornor- 
mal basis for L 2 (X) of eigenfunctions of P, with eigenvalues tending to 
+oo. Show that the eigenfunctions are in C°°(X). (Hint: show that 
there exists a basis of such eigenfunctions for the generalized inverse Q 
and then see what you can say about P.) 

Exercise 3.11. Let X be compact. 

(1) Show that the principal symbol of A, the Laplace-Beltrami op- 
erator on a compact Riemannian manifold, is just 

the metric induced on the cotangent bundle. 

(2) Using the previous exercise, conclude that there exists an or- 
thonormal basis for L 2 (X) of eigenfunctions of A, with eigen- 
values tending toward +oo. 

Exercise 3.12. Work out the Hormander "square root trick" on a com- 
pact manifold X as follows. 

(1) Show that if P G ty°(X) is self-adjoint, with positive principal 
symbol, then P has an approximate square root, i.e. there exists 
Q G V°(X) such that Q* = Q and P - Q 2 G V-°°(X). (Hint: 
Use an iterative construction, as in the proof of existence of 
elliptic parametrices.) 



MICROLOCAL ANALYSIS 



27 



(2) Show that operators in ^~°°(X) are L 2 -bounded. 

(3) Show that an operator A G V°(X) is L 2 -bounded. (Hint: Take 
an approximate square root of XI — A* A for A>0.) 

As usual, let A denote the Laplacian on a compact manifold. By 
Exercise 3.12, there exists an operator A G ^J 1 (X) such that A 2 = 
A + R, with R G \I/ -0O pr). By abstract methods of spectral theory, we 
know that y/~K exists as an unbounded operator on L 2 {X). (This is a 
very simple use of the functional calculus: merely take \fK to act by 
multiplication by Xj on each <f>j, where (<pj,X 2 ) are the eigenfunctions 
and eigenvalues of the Laplacian, from Exercise 3.11.) In fact, we can 
improve this argument to obtain: 

Proposition 3.5. 

v^A G ^\X). 

Indeed, it follows from a theorem of Seeley that all complex powers 
of a self-adjoint, elliptic pseudodifferential operator 18 on a compact 
manifold are pseudodifferential operators. 

All proofs of the proposition seem to introduce an auxiliary param- 
eter in some way, and the following (taken directly from [24, Chapter 
XII, §1]) seems one of the simplest. An alternative approach, using 
the theory of elliptic boundary problems, is sketched in [26, pp. 32-33, 
Exercises 4-6]. 

Proof. Let A be the self-adjoint parametrix constructed in Exercise 3.12, 
so that 

A 2 - A = R G y-°°(X). 
By taking a parametrix for the square root of A, in turn, we obtain 

A = B 2 + R! 

with B G ^ 1 / 2 (X) and R' G \& _0 °, both self-adjoint; then pairing with 
a test function shows that 

(AM) > (R'4>A) > -c\\u\\ 2 

for some C G R. Thus, A can only have finitely many nonpositive eigen- 
values (since it has a compact generalized inverse) hence its eigenvalues 
can accumulate only at +oo). So we may alter A by the smoothing 
operator projecting off of these eigenspaces, and maintain 

A 2 - A = Re^-°°(X) 

(with a different R, of course) while now ensuring that A is positive. 



18 Seeley's theorem is better yet: self-adjointness is unnecessary. 
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Now we may write, using the spectral theorem, 

(A')- 1/2 = — / z- l ' 2 {{A')-z)- l dz 
2m J r 

where T is a contour encircling the positive real axis counterclockwise, 
and given by Imz = Rez for z sufficiently large, and A' is given by 
A minus the projection onto constants (hence has no zero eigenvalue). 
(The integral converges in norm, as self-adjointness of A' yields 

Likewise, since A 2 = A' + R (with R yet another smoothing operator) 
we may write 



dz 



Hence 

(AT 1/2 " A- 1 = jf z-^ [((A') - z)^ - ((A') +R- z)^] dz 

= — ( z~ 1/2 ((A') - z)~ 1 R((A') + R — zy 1 dz. 
2m J r 

Now the integrand, ^ _1 / 2 ((A / ) — z)~ 1 R((A') + R — z)" 1 , is for each z 
a smoothing operator, and decays fast enough that when applied to 
any u G V(X), the integral converges to an element of C°°(X) (in 
particular, the integral converges in C°(X), even after application of 
A k on the left, for any k). Hence 

(A')- 1/2 -A' 1 = Ee 

thus we also obtain 

as (A') 1 / 2 differs from A 1//2 by the smoothing operator of projection 
onto constants, this shows that 

A 1/2 G V\X). □ 

4. Wavefront set 

If P G ^ m (X) and (x ,fo) G S*X, we say P is elliptic at (x ,fo) if 
& m (P)(x , £ ) 7^ 0. Of course if P is elliptic at each point in S*X, it is 
elliptic in the sense defined above. We let 

ell(P) = : P is elliptic at 

and let 

S P = S*X\en(X); 
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Sp is known as the characteristic set of P. 

Exercise 4.1. 

(1) Show that ellP C WF'P. 

(2) If P is a differential operator of order m of the form a a (x)D a 
then show that WF'P = 7r*(U supp a a ), while ellP may be 
smaller. 

The following "partition of unity" result, and variants on it, will 
frequently be useful in discussing microsupports. It yields an operator 
that is microlocally the identity on a compact set, and microsupported 
close to it. 

Lemma 4.1. Given K C U C S*X with K compact, U open, there 
exists a self-adjoint operator B e fy°(X) with 

WF'(Id -B) C\K = 0, WF' B CU. 

Exercise 4.2. Prove the lemma. (Hint: You might wish to try con- 
structing B in the form 

Op(Mot(Id)) 

where o" to t(Id) is the total symbol of the identity (which is simply 1 for 
all the usual quantizations on R") and is a cutoff function equal to 
1 on K and supported in U. Then make B self- adjoint.) 

Theorem 4.2. If P e ^ m (X) is elliptic at (xo,£o), there exists a 
microlocal elliptic parametrix Q e ^~ m (X) such that 

(x , £ ) ^ WF' (PQ - /) U WF'(QP - /). 

In other words, you should think of Q as inverting P microlocally 
near (z ,£o)- 

Exercise 4.3. Prove the theorem. (Hint: If P is a microlocal partition 
of unity as in Lemma 4.1, microsupported sufficiently close to (rr ,£o) 
and microlocally the identity in a smaller neighborhood, then show 

W = BP + A Op((0 m )(Id -B) 

is globally elliptic provided A G C is chosen appropriately. Now, using 
the existence of an elliptic parametrix for W, prove the theorem.) 

Let u be a distribution on a manifold X. We define the wavefront set 
of u as follows. 

Definition 4.3. The wavefront set of u, 

WFm C S*X, 
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is given by 

(x ,fo) i WF M 

if and only if there exists P G \I/ (X), elliptic at (x ,£o)> such that 

Pu G C°°. 

Exercise 4.4. Show that the choice of ^/°(X) in this definition is im- 
material, and that we get the same definition of WFw if we require 
P G ty m (X) instead. 

Note that the wavefront set is, from its definition, a closed set. In- 
stead of viewing WFw as a subset of S*X, we also, on occasion, think 
of WF u as a conic subset of T*X\o, with o denoting the zero section; 
a conic set in a vector bundle is just one that is invariant under the R + 
action on the fibers. 

An important variant is as follows: we say that 

(*o,£o)£WF™ U 
if and only if there exists P G ty m (X), elliptic at (x ,C, ) such that 

Pu G L 2 (X). 

Proposition 4.4. WFm = if and only if u G C°°(X); WF m n = if 
and only if u G H%.(X). 

The wavefront set serves the purpose of measuring not just where, 
but also in what (co-) direction, a distribution fails to be in C°°(X) (or 
H m in the case of the indexed version). It is instructive to think about 
testing for such regularity, at least on M n , by localizing and Fourier 
transforming. Given (x ,£o) G S*W a , let G C^°(W n ) be nonzero at x ; 
let 7 G C°°{R n ) be given by 

7(0=iKlj|| -6>|)x(KI) 

where ■?/> is a cutoff function supported near x = and x(0 G C°°(M) is 
equal to for t < 1 and 1 for t > 2. Think of 7 as a cutoff in a cone of 
directions near £o ? but modified to be smooth at the origin. (We will 
use such a construction frequently, and refer in future to a function 
such as 7 as a "conic cutoff near direction £o-"-) 

Now note that 4>(x)^f(^) is a symbol of order zero, and 

(4.1) Op,(0(x)7(O)* = Op r (^(x) 7 (0)« = (2vr)-^- 1 7(O^(0«). 

By definition, if Op e ((j)(x)^(^))*u G C°°, then (x ,£ ) £ WFm. Note 
that since (pu has compact support, we automatically have F{(f)u) G 
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C 00 ,henceJ-' 1 ^J r ((pu) is rapidly decreasing. Since J 7 is an isomorphism 
from 5(IR n ) to itself, we see that it in fact suffices to have 

7^(<M) G S(W l ) 

to be able to conclude that (xo,£o) ^ WFw. Conversely, one can check 
that the class of operators of the form 

o P ,(0(x) 7 (or 

is rich enough that this in fact amounts to a characterization of wave- 
front set: 

Proposition 4.5. We have (x ,£,o) ^ WF-u if and only if there exist 
0, 7 as above with 

7J r (<K) G S(R n ). 

Exercise 4.5. Prove the Proposition. (Hint: If A G ^°(IR n ) is elliptic 
at (x ,Co) and Au G C 00 (R"), construct B = Op e ((f)(x)-f(Z))* as above 
so that WF' B is contained in the set where A is elliptic. Hence there 
is a microlocal parametrix Q such that B(QA — I) G ^ _00 (X).) 

Note that if w is smooth near x , then we have <pu G C™(W l ) for 
appropriately chosen 0, hence there is no wavefront set in the fiber 
over xo. 

If, by contrast, u is not smooth in any neighborhood of Xq, then we 
of course do not have J-"(0m) G S, although it is in the wavefront 
set includes the directions in which it fails to be rapidly decaying. 

Thus, we can easily see that in fact the projection to the base vari- 
ables of WFu is the singular support of u, i.e. the points which have 
no neighborhood in which the distribution u is a C°° function. 

Exercise 4.6. Let Q C IR n be a domain with smooth boundary. Show 
that WF If} = SN*(dQ), the spherical normal bundle of the boundary. 
(Hint: You may want to use the fact that the definition of WFw is 
coordinate-invariant . ) 

We have a result constraining the wavefront set of a solution to a 
PDE or, more generally, a pseudodifferential equation, directly follow- 
ing from the definition: 

Theorem 4.6. If Pu G C°°(X), then WFuC £ P . 

Proof. By definition, Pu G C°° (X) means that WF u n ell P = 0. □ 

Theorem 4.7. If P e WFP« C WFmH WF'P. 

Exercise 4.7. Prove this, using microlocal elliptic parametrices for the 
inclusion in WF w. 



32 



JARED WUNSCH 



The property of pseudodifferential operators that WF Pu C WF u is 
called "microlocality:" the operators are not "local," in that they do 
move supports of distributions around, but they don't move singulari- 
ties, even in the refined sense of wavefront set. 

We shall also need related results on Sobolev based wavefront sets 
in what follows: 

Proposition 4.8. If P e ^ m (X), WF k ' m Pu C WF^nWF'P for 
all keR. 

Corollary 4.9. let P £ ^ m (X). If 

WF' P fl WF m u = 

then 

Pu e L 2 (X). 

Exercise 4.8. Prove the proposition (again using a microlocal elliptic 
parametrix) and the corollary. 

We will have occasion to use the following relationship between or- 
dinary and Sobolev-based wavefront sets: 

Proposition 4.10. 

WFm = |JWF fc M. 

k 

Exercise 4.9. Prove the proposition. 

Exercise 4.10. Let □ denote the wave operator, 

Du = D\u - Au 

on M = R x X with X a Riemannian manifold 
front set of u is a subset of the "wave cone" {r 2 
dual variable to t and £ to x in T*(M). 

Exercise 4.11. 

(1) Let k < n, and let i : M fc — > R ra denote the inclusion map. 

Show that there is a continuous restriction map on compactly 
supported distributions with no wavefront set conormal to M fc : 

i* :{ue £'{R n ) : WFm n SN*{R k ) = 0} ->■ S'(R k ). 

HINT: Show that it suffices to consider u supported in a small 
neighborhood of a single point in R k . Then take the Fourier 
transform of u and try to integrate in the conormal variables to 
obtain the Fourier transform of the restriction. 



. Show that the wave- 
= \£\ 2 } where r is the 
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(2) 



Show that, with the notation of the previous part, 
WF l*u C l*(WFu) 



where : T^ fc M n — > T*R k is the naturally defined projection 
map. 

(3) Show that both the previous parts make sense, and are valid, 
for restriction to an embedded submanifold Y of a manifold X. 

(4) Show that if u is a distribution on Rj; and v is a distribution on 
M. 1 then w = u(x)v(y) is a distribution on ~R k+l and 

WFw C [(suppw,0) x WFv] U [WF« x (suppw,0)] U WFm x WFv. 

(Hint: Localize and Fourier transform, as in (4.1).) 

You might wonder: given P, can the wavefront set of a solution to 
Pu = be any closed subset of S? The answer is no, there are, in 
general, further constraints. To talk about them effectively, we should 
digress briefly back into geometry. 

4.1. Hamilton flows. We now amplify the discussion §3.2 of Hamil- 
tonian mechanics and symplectic geometry, generalizing it to a broader 
geometric context. 

Let N be a symplectic manifold, that is to say, one endowed with a 
closed, nondegenerate 19 two-form. (Our prime example is N = T*X, 
endowed with the form ^ d£j Acta/ 7 ; by Darboux's theorem, every sym- 
plectic manifold in fact locally looks like this.) 

Given a real- valued function a e C°°(N), we can make a Hamilton 
vector field from a as follows: by nondegeneracy, there is a unique 
vector field H a such that Ly\ a u = ou(-, H a ) = da. 

Exercise 4.12. Check that in local coordinates in T*X, 



Thus, for any smooth function b, we may define the Poisson bracket 



Exercise 4.13. Check that the Poisson bracket is antisymmetric. 

It is easy to verify that the flow along H a preserves both the sym- 
plectic form and the function a : we have from Cartan's formula (and 



Nondegeneracy of u> means that contraction with u> is an isomorphism from 
T p N to T*N at each point. 




{a,b} = H a (b) 



since u is closed): 



£i-i a (w) = dt^ a u = d(da) = 0; 
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also, 

H (a) = da(H a ) = w(H a , H a ) = 0. 

The integral curves of the vector field H a are called the bicharac- 
teristics of a and those lying inside S a = {a = 0} are called nn// 
bicharacteristics. 

Exercise* 4.14. 

(1) Show that the bicharacteristics of |£| = (02(A)) 1 / 2 project to 
X to be geodesies. The flow along the Hamilton vector field of 
|£| is known as geodesic flow. 

(2) Show that the null bicharacteristics of a 2 (0) are lifts to T*(R x 
X) of geodesies of X, traversed both forward and backward at 
unit speed. 

Recall that the setting of symplectic manifolds is exactly that of 
Hamiltonian mechanics: given such a manifold, we can regard it as 
the phase space for a particle; specifying a function (the "energy" or 
"Hamiltonian" ) gives a vector field, and the flow along this vector field 
is supposed to describe the time-evolution of our particle in the phase 
space. 

Exercise 4.15. Check that the phase space evolution of the harmonic 
oscillator Hamiltonian, (1/2) (£ 2 + x 2 ) on T*R, agrees with what you 
learned in physics class long ago. 

4.2. Propagation of singularities. 

Theorem 4.11 (Hormander). Let Pu E C°°(X), with P E ^ m (X) an 
operator with real principal symbol. Then WF u is a union of maximally 
extended null bicharacteristics of a m (P) in S*X. 

We should slightly clarify the usage here: to make sense of these null 
bicharacteristics, we should actually take the Hamilton vector field of 
the homogeneous version of the symbol, cr m (P); this is a homogeneous 
vector field, and its integral curves thus have well-defined projections 
onto S*X. If the Hamilton vector field should be "radial" at some point 
q E T*X, i.e. coincide with a multiple of the vector field £ • d$ there, 
then the projection of the integral curve through q is just a single point 
in S*X, and the theorem gives no further information about wavefront 
set at that point. 

For P = □, the theorem says that the wavefront set lies in the 
"light cone," and propagates forward and backward at unit speed 
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along geodesies. If we take the fundamental solution to the wave equa- 
tion 20 w = sin(tv^A/v / A)5 p , it is not hard to compute that in fact for 
small, nonzero time, 21 

WF« C N*{d(-,p) = \t\} = C; 

This is a generalization of Huygens's Principle, which tells us that in 
Rxl", the support of the fundamental solution is on this expanding 
sphere (but which is a highly unstable property). Note that C is in fact 
the bicharacteristic flowout of all covectors in E projecting to N*({p}) 
at t — 0, and under this interpretation, C C T*(R x X) makes sense 
for all times, not just for short time, regardless of the metric geometry. 
We shall return to and amplify this point of view in §9. 

Exercise 4.16. 

(1) Suppose that Du = on R x R n and u(t,x) G C°° for (t,x) G 
(— e,e) x 5(0,1) for some e > 0. Show, using Theorem 4.11, 
that u G C°° on < 1 — \ t\~\. Can you show this more directly 
using the energy methods described in §2.3? 

(2) Suppose that Uu = on R x R n and u(t,x) G C°° for (t,x) G 
(-e,e) x (B(0, l)\B(0, 1/2)) for some e > 0. Show, using the 
theorem, that u G C°° in {\x\ < 1 - \t\} n {\t\ G (3/4, 1)} 

Proof. 22 Note that we already know that WF-u C Sp by Theorem 4.6, 
hence what remains to be proved is the flow-invariance. 

Let g G Sp C S*X. By homogeneity of a m (P), we can write the 
Hamilton vector field in T*X in a neighborhood of q as 

(4.2) H p = \tr-\V + hR), 

where R denotes the radial vector field £ • d%, h is a function on S*X, 
and V is the pullback under quotient of a vector field on S*X itself, 
i.e. V is homogeneous of degree zero with no radial component, hence 
of the form J2j fj( x ^)9^ + gj(x,£)d x j. Note that if a is homogeneous 
of degree / then 

(4.3) Ra = la. 

(Exercise: Verify these consequences of homogeneity.) 



This is the spectral-theoretic way of writing the solution with initial value 
and initial time-derivative S p . 

21 Well, I am cheating a bit here, as we don't haven't stated any results allowing 
us to relate the wavefront set of Cauchy data for the wavefront set of the solution 
to the equation. To understand how to do this, you should read [17]. 

22 This proof is very close to those employed by Melrose in [17] and [18]. 
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By the comments above, we may take V ^ near q; otherwise the 
theorem is void. Thus, without loss of generality, we may employ a 
coordinate system a±, . . . , ai n -i for S*X in which 

(4.4) v = a Q1 , 

hence using a, |£| as coordinates in T*X, 

Hp = d ai + hR; 

we may shift coordinates so that a(q) = 0. We split the a variables 
into cci and a' = (a 2 , . . . , a n -i). 

Since WF u is closed, it suffices to prove the following: if q ^ WF u 
then $4(9) ^ WF« for t G [—1,1], where $ t denotes the flow gener- 
ated by V. 23 (This will show that the intersection of WF« with the 
bicharacteristic through q is both open and closed, hence is the whole 
thing.) 

We can make separate arguments for t G [0, 1] and t G [—1,0], and 
will do so (in fact, we will leave one case to the reader). 

For simplicity, let us take Pu = 0; we leave the case of an inhomoge- 
neous equation for the reader (it introduces extra terms, but no serious 
changes will in fact be necessary in the proof). 

Since WF-u is closed, our assumption that q WF« tells us that 
there is in fact a 25-neighborhood of in the a coordinates that is 
disjoint from WFu; we are trying to extend this regularity along the 
rest of the set (a±, a') G [0, 1] x 0. We proceed as follows: let 

(4.5) s = sup{s : WF S u n {(an, a) G [0,1] x B(0,5)} = 0}. 
Pick any s < s . We will show that in fact 

(4.6) WF S+1/2 u D {(ai, a') G [0, 1] x B(0, 5)} = 0, 

thus establishing that so = 00, which is the desired result (by Proposi- 
tion 4.10). One can regard this strategy as iteratively obtaining more 
and more regularity for u along the bicharacteristic (i.e. the idea is 
that we start by knowing some possibly very bad regularity, and we 
step by step conclude that we can improve upon this regularity, half a 
derivative at a time). More colloquially, the idea is that the "energy," 
as measured by testing the distribution u by pseudodifferential opera- 
tors, should be comparable at different points along the bicharacteristic 
curve. 



Of course, we are assuming here that the interval [—1,1] remains in our coor- 
dinate neighborhood; rescale the coordinates if necessary to make this so. 
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Now we prove the estimates that yield (4.6) via commutator meth- 
ods. Let 0(s) be a cutoff function with 

>0 an (-1,1), 
[ ' } supp0=[-l,l] 



Let (f)s{s) = </)(5~ 1 s); arrange that G Let x be a cutoff function 
equal to 1 on (0, 1) and with y/ = 4>i — 4 ) 2, with tpi supported on (—5, 5) 
and ip2 on (1 — 5, 1 + 5); we will further assume that v /%, G 

Exercise 4.17. Verify that cutoffs with these properties exist. 

In our coordinate system for S*X, let 

a = <MK|) X (aiK Aai eC°°(S*X), 

with A ^> to be chosen presently. Passing to the corresponding 



function on a G C°°(T*X) that is homogeneous of degree 2s 
we have 



m + 2, 



-1 ( - A0 5 (KlM"i)e 
+ <MKl)(^i-^ 2 )e- AQ 



(4.8) H p (a) = |er 

/i(a)(2s-m + 2)a) 

with h given by (4.2). Since a 25 coordinate neighborhood of the origin 
was assumed absent from WFu, we have in particular ensured that 
supped a' |)-?/>i(a:i) is contained in (WFw) c . We also have suppa C 
(WF s u) c by (4.5), since s < s . 





supp^i^ 



supp a 



SUpp ^ 2 <f>6 



-Aai 




FIGURE 1. The support of the commutant and its 
value along the line a' — 0. The support of the term 
ipi(ai)(ps(\o:'\) is arranged to be contained in the com- 
plement of WFu, while the support of the whole of a is 
arranged to be in the complement of WF S u. 
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Let A G i]/ 2s-m+2 (X) be given by the quantization of a. 24 Since 
o m (P) is real by assumption, we have P* — P G '§> m ~ 1 (X). (Exercise: 
Check this!) Thus the "commutator" P*A — AP, which is a priori of 
order 2s + 2, has vanishing principal symbol of order 2s + 2, hence it 
in fact lies in ty 2s+1 (X), and we may write 

(P*A - AP) = [P, A] + (P* - P)A, 

with 

(4.9) i<7 2i+1 ([P, A] + (P* - P)A) = H» + <7 m _!(P* - P)a 

= -A0,(|a / |)x(a 1 K Aai |e| 2s+1 + 5 (|« / |)(^i-^)e- Aai |e| 2s+1 

+ (ia m ^(P* -P) + h(a)(2s -m + 2))a, 

by (4. 2), (4. 3), and (4.4). If A ^> is chosen sufficiently large, we may 
absorb the third term into the first, and write the RHS of (4.9) as 

-f(a)M\o/\)xM + MWWi - ^)e~ Aai 

with / > on the support of <psX- 

Let B G ^( 2s+1 )/ 2 (X) be obtained by quantization of 

\t\ s+1 '\f(a)<j>s(\a'\)xM) 1/2 -, 
and let Q G ^( 2s+1 )/ 2 (X) be obtained by quantization of 
|er 1/2 (^(K|)^(a 1 )) 1 /VW2. 

Then by the symbol calculus, i.e. by Properties III, IV of the calculus 
of pseudodifferential operators, 

(4.10) i(P*A-AP) = i(P*-P)A+i[P,A] = -B* B+Cld-C* 2 C 2 +R 

with R G ty 2s (X), hence of lower order than the other terms; moreover 
we have WF' R C supp a. 

Now we "pair" both sides of (4.10) with our solution u. We have 

i((P*A - AP)u,u) = ((—B*B + C*d - C* 2 C 2 + R)u,u); 

as we are taking Pu = 0, the LHS vanishes. 25 We thus have, rearrang- 
ing this equation, 

(4.11) \\Bu\\ 2 + \\C 2 u\\ 2 = \\C lU \\ 2 + (Ru,u). 

24 I.e., really A is given by cutting off a near £ = to give a smooth total symbol 
and quantizing that. 

25 In the case of an inhomogeneous equation, it is of course here that extra terms 
arise. 
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I claim that the RHS is finite: Recall that R lies in \l/ 2s (X). Let A be 
an operator of order — s, elliptic on WF' it! and with WF'A contained 
in the complement of WF S u. 

Exercise 4.18. Show that such a A exists. 

Thus, letting T be a microlocal parametrix for A on WF' R, we have 

WF'i2nWF'(Id-AT) = 0, 

hence 

R — ATR = E E V-°°(X). 

Thus, 

\(Ru,u)\ < | (TRu, A*u) | + \{Eu,u)\ < oo 
by Corollary 4.9 since WF' TR U WF' A* C (WF s n) c (and since E is 
smoothing). Returning to (4.11), we also note that the term ||Ci-u|| 2 
is finite by our assumptions on the location of WF S+1 ^ 2 u (and another 
use of Corollary (4.9)). Thus, 

\\Bu\\ < oo, 

and consequently, 

WF S+1/2 uDeWB = 0, 
which was the desired estimate. □ 

Exercise 4.19. Now see how the argument should be modified to yield 
absence of WF S+1/2 u on 

{a e [-1,0], a' = 0}. 

One cheap alternative to going through the whole proof might be to 
notice that we also have (— P)u G and that H„ p = — H p ; thus, the 
"forward propagation" that we have just proved should yield backward 
propagation along H p as well. 

The fine print: Now, having done all that, note that it was a cheat. 
In particular, we didn't know a priori that we could apply any of the 
operators that we used to u and obtain an L 2 function, let alone jus- 
tify the formal integrations by parts used to move adjoints across the 
pairings. Therefore, to make the above argument rigorous, we need 
to modify it with an approximation argument. This is similar to the 
situation in Exercise 2.7, except in that case, we had a natural way 
of obtaining smooth solutions to the equation which approximated the 
desired one: we could replace our initial data ipo for the Schrodinger 
equation by, for instance, e~ eA ipo] the solution at later time is then just 
e~ eA ip, and we can consider the limit e I 0. In the general case to which 
this theorem applies, though, we do not have any convenient families of 
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smoothing operators commuting with P. So we instead take the tack of 
smoothing our operators rather than the solution u. We should manu- 
facture a family of smoothing operators G € that strongly approach the 
identity as e 4, 0, and replace A by AG f everywhere it appears above. 
If we do this sensibly, then the analogs of the estimates proved above 
yield the desired estimates in the e I limit. Of course, we need to 
know how G t passes through commutators, etc., so the right thing to 
do is to take the G e themselves to be pseudodifferential approximations 
of the identity, something like 

G e = Op e (<p(e\t\)) 

on R n , with <p G C£°(R) a cutoff equal to 1 near 0. We content ourselves 
with referring the interested reader to [18] for the analogous develop- 
ment in the "scattering calculus" including details of the approximation 
argument. 

Exercise 4.20. 

(1) Show the following variant of Theorem 4.11: if P G 4> m (X) is 
an operator with real principal symbol, and Pu G C°°(X), show 
that WF fc u is a union of maximally extended bicharacteristics 
of P for each k G R. (Hint: the proof is a subset of the proof of 
Theorem 4.11.) 

(2) Show the following inhomogeneous variant of Theorem 4.11: if 
P G ty m (X) is an operator with real principal symbol, and Pu = 
/, show that WF u\ WF / is a union of maximally extended 
bicharacteristics of P. 

Exercise 4.21. 

(1) What does Theorem 4.11 tell us about solutions to the Schrodinger 
equation? (Hint: not much.) 

(2) Nonetheless: let ip(t,x) be a solution to the Schrodinger equa- 
tion on M x X with (X, g) a Riemannian manifold; suppose that 
ip(0,x) =^ 6 H l ' 2 {X). Define a set Si C S*X by 

q <£ S 1 there exists A G ^(X), q G ell(A), 

f 1 2 
such that / \\Aip\\ dt < 00. 

Jo 

(In other words, S\ is a kind of wavefront set measuring where 
in the phase space S*X we have ip G L 2 ([0, 1]; if^JT)) — cf. 
Exercise 2.7.) 

Show that Si is invariant under the geodesic flow on S*X. 
(See Exercise 4.14 for the definition of geodesic flow.) 
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(Hint: use (2.2) with A an appropriately chosen pseudodif- 
ferential operator of order zero, constructed much like the ones 
used in proving Theorem 4.11.) 

Reflect on the following interpretation: "propagation of L 2 H l 
regularity for the Schrodinger equation occurs at infinite speed 
along geodesies." 

5. Traces 

It turns out to be of considerable interest in spectral geometry to 
consider the traces of operators manufactured from A, the Laplace- 
Beltrami operator on a compact 26 Riemannian manifold. The famous 
question posed by Kac [15], "Can one hear the shape of a drum," has a 
natural extension to this context: Recall from Exercise 3.11 that there 
exists an orthonormal basis <f>j of eigenfunctions of A with eigenvalues 
A 2 — > +oo; what, one wonders, can one recover of the geometry of a 
Riemannian manifold from the sequence of frequencies A.,? Using PDE 
methods to understand traces of functions of the Laplacian has led to 
a better understanding of these inverse spectral problems. 

Recall from Proposition 3.5 that \/A is a first-order pseudo differen- 
tial operator on X. It is a slightly inconvenient fact that while \/~K e 
^ 1 (X), y/A ^ ^(I x I) : its Schwartz kernel is easily seen to be 
singular away from the diagonal. But this turns out be be of little 
practical importance for our considerations here: it is close enough! 

Let us now consider the operator 

(5.1) U(t) = e~ itVE 

which can be defined by the functional calculus to act as the scalar 
operator e~ ltXj on each <f>j. U(t) is unitary, and indeed is the solution 
operator to the Cauchy problem for the equation 

(5.2) (d t + iVA)u = 0; 
that is to say, if u = U (t)f, we have 

{d t + iVA)u = 0, and u(0,x) = f(x). 

Equation (5.2) is easily seen to be very closely related to the wave 
equation: if u solves (5.2) then applying d t — iv^A, we see that u also 
satisfies the wave equation. Of course, (5.2) only requires a single 



We especially emphasize that X denotes a compact manifold throughout this 
section. 
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Cauchy datum, unlike the wave equation, so the trade-off is that the 
Cauchy data of a as a solution to Du = are constrained: we have 

u(0, x) = f(x), d t u(0, x) = -iy/Af. 

The real and imaginary parts of the operator U (t) are exactly the solu- 
tion operators to the (more usual) Cauchy problem for the wave equa- 
tion with u(0, x) = f(x), d t u(0, x) — and with u(0, x) = 0, d t u(0, x) = 
—iy/~Kf(x) respectively 

Why is the operator U(t) of interest? Well, suppose that we are 
interested in the sequence of A/s. It makes sense to combine these 
numbers into a generating function, and certainly one option would be 
to take the exponential sum 27 




j 



This is, at least formally, nothing but the trace of the operator U(t). 
One of the principal virtues of this generating function is that if we let 
N(X) denote the "counting function" 

N(X) = #{A, < A}, 

then we have 

1V(A) = 

3 

hence 

^e^' = (27r)"/ 2 J- A ^(iV / (A))(t). 

This is all a bit optimistic, as U(t) is easily seen to be not of trace 
class — for example at t — it is the identity. So we should try and 
think of Tr U (t) as a distribution. We do know that for any test function 
(p(t) E 5(R) and any / G L 2 {X), 

J <p(t)U(t)fdt = J {I + D 2 t )- k {\ + D 2 t ) k {y{t))U{t)f dt 
(5.3) = J(l + D 2 n V (t))(l + D 2 t y k U(t)fdt 

= J(i+D 2 ) k ^(t))(i+^r k u(t)fdt, 

27 This choice of generating function, corresponding to taking the wave trace, is 
of course one choice among many. Some other approaches include taking the trace 
of the complex powers of the Laplacian or the heat trace. The idea of using (at 
least some version of) the wave trace originates with Levitan and Avakumovic. 
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since D\ U = AU. Here we can, if we like, consider (1 + A)~ k to be 
defined by the functional calculus; it is in fact pseudodifferential, of 
order —2k. We easily obtain (using either point of view) the estimate: 

(1 + A)- k U(t) : L 2 (X) -> H 2k (X); 

hence, for fc>0, the operator (1 + A)~ k U(t) is of trace class. 

Exercise 5.1 . Prove that this operator is of trace class for k 3> 0. (Hint: 
One easy route is to think about first choosing k large enough that the 
Schwartz kernel is continuous, hence the operator is Hilbert-Schmidt; 
then you can take k even larger to get a trace-class operator, by factor- 
ing into a product of two Hilbert-Schmidt operators (see Appendix).) 

Equation (5.3) thus establishes that 

Tr U(t) : up ^ Tr J (p(t)U(t) dt 

makes sense as a distribution on R. We can thus write 
(5.4) Tr U(t) = {2ir) n ' 2 F{N'){t). 

where both sides are defined as distributions. Our next goal is to try 
to understand the left side of this equality through PDE methods. 

Exercise 5.2. Show that if the Schwartz kernel K(x,y) of a bounded, 
normal operator T on L 2 (X) is in C k (X) for sufficiently large k, then 
T is of trace-class and 

TrT = J K(x,x)dg(x). 

(Hint: Check that K is trace-class as in the previous exercise. Then 
apply the spectral theorem for compact normal operators, and use the 
basis of eigenfunctions of K when computing the trace. The crucial 
thing to check is that if ipj are the eigenfunctions, then 

the delta-distribution at the diagonal, since this is nothing but a spec- 
tral resolution of the identity operator.) 

As a consequence of Exercise 5.2, we can compute the distribution 
Tr U(t) in another way if we can compute the Schwartz kernel of U(t). 
Indeed, knowing even rather crude things about U (t) can give us some 
useful information here. 

Theorem 5.1. Let $ t be the geodesic flow, i.e. the flow generated by 
the Hamilton vector field of |£| = g^iiij) 1 ^ 2 ■ Then 

WFU(t)f = $ t (WFf). 
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We begin with a lemma: 
Lemma 5.2. Let (d t + iyfK)u = 0. Then 

(x ,So) E WFu\ t=t0 

if and only if 

(t = t ,r = -|£oUo,£o) e WFm. 

Proof. 28 Suppose g = (xoj£o) £ WFw| t=to . Since g = (t = t ,r = 

— ICol) ^0) £o) is the only vector in £ 9t+iv /^ that projects to (:r ,£ ), it 
must lie in the wavefront set of u by Exercise 4.11. 

The converse is harder. Suppose q £ WFu| t=to . Let v = H(t — t )u, 
with H denoting the Heaviside function. Then 

(dt + iVA)v = S(t - t )u(t , x) = f. 

and v vanishes identically for t < t . By the last part of Exercise 4.11, 

q$WFf, 

hence (since WF / only lies over t = t ) certainly no points along the 
bicharacteristic through q lie in WF /. Moreover, no points along this 
bicharacteristic lie in WFt> for t < to (since v is in fact zero there). 
Hence by the version of the propagation of singularities in the second 
part of Exercise 4.20, this bicharacteristic is absent from WFm. In 
particular, q ^ WFm. □ 

Theorem 5.1 now follows directly 29 from the lemma and Theorem 4.11. 
We now require a result on microlocal partitions of unity somewhat 
generalizing Lemma 4.1: 

Exercise 5.3. Let pj, j — 1, . . . , N be a smooth partition of unity for 
S*X. Show that there exists A s G with WF'A, = supppj, 

a (Aj) = pj, A* = Aj, and 

N 

with R e 

28 I am grateful to Andras Vasy for showing me this proof. 

29 Here is one of the places where we should worry about the fact that VA is not 
a pseudodifferential operator on R x X. This problem is seen not to affect the proof 
of Hormander's theorem if we note that composing ^/A with a pseudodifferential 
operator that is microsupported in a neighborhood of the characteristic set {|r| = 

— yields an operator that is pseudodifferential, and that the symbol calculus 
extends to such compositions. (The author confesses that this is not entirely a 
trivial matter.) 
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For a distribution u, let singsupp-u (the "singular support" of u) be 
the projection of its wavefront set, i.e. the complement of the largest 
open set on which it is in C°°. 

Theorem 5.3. 

singsupp Tr U(t) C {0} U {lengths of closed geodesies on X}. 

This theorem is due to Chazarain and to Duistermaat-Guillemin. 
We begin with the following dynamical result: 

Lemma 5.4. Let L not be the length of any closed geodesic. Then 
there exists e > and a cover Ui of S*X by open sets such that for 
t G (L — e, L + e), there exists no geodesic with start- and endpoints 
both contained in the same Ui. 

Exercise 5.4. 

(1) Prove the lemma. (Hint: The cosphere bundle is compact.) 

(2) As long as you're at it, show that is an isolated point in the 
set of lengths of closed geodesies ("length spectrum"), and that 
the length spectrum is a closed set. 

We now prove Theorem 5.3. 

Proof. Let L not be the length of any closed geodesic on X. Let Uj be 
a cover of S*X as given by Lemma 5.4. Let pj be a partition of unity 
subordinate to Uj and let Aj be a microlocal partition of unity as in 
Exercise 5.3. Then, calculating with distributions on R 1 , we have 

Tr U(t) = Tr A2 j u (t) + Tr RU (t) 

j 

= Tr AjU(t)Aj + Tr RU(t) 

j 

and, more generally, 

D\ m Tr U(t) = J2 Tl A i & m U(t)Aj + Tr RA m U(t). 

3 

Let u be a distribution on X; then WFA,-w C WF'A, C Uj. Thus 
Theorem 5.1 gives 

WF A m U(t)A jU C & t (Uj). 

But by construction, this set is disjoint from Uj and hence from WF' Aj. 
Hence for any m, 30 

AjA m U(t)Aj e L°°([L - e, L + e]; * _0O (X)); 

30 We technically have to work just a little to obtain the uniformity in time: 
observe that Aj A m U(t)Aj are a continuous (or even smooth) family of smoothing 
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consequently, 



D 2 t m TrU(t) e L°°([L-e,L + e]). □ 



Exercise 5.5. Show that in the special case of X = S 1 , Theorem 5.3 
can be deduced from the Poisson summation formula. For this reason 
it is often referred to as the Poisson relation. 

One is tempted to conclude from (5.4) and Theorem 5.3 that one can 
"hear" the lengths of closed geodesies on a manifold, since the right side 
of (5.4) is determined by the spectrum, and the left side seems to be 
a distribution from whose singularities we can read off the lengths of 
closed geodesies. The trouble with this approach is that we do not know 
with any certainty from Theorem 5.3 that the putative singularities in 
Tr U (t) at lengths of closed geodesies are actually there: perhaps the 
distribution is, after all, miraculously smooth. Thus, proving actual 
inverse spectral results requires somewhat more care, as we shall see. To 
this end, we will begin studying the operator U (t) more constructively 
in the following section. 



In order to learn more about the wave trace, we will have to bite 
the bullet and construct an approximation ( "parametrix" ) for the fun- 
damental solution to the wave equation on a manifold. The approach 
will have a similar iterative flavor to the technique we used to construct 
an approximate inverse for an elliptic operator, but we have now left 
the comfortable world of pseudodifferential operators: the parametrix 
we construct is going to be something rather different. Exactly what, 
and how to systematize the kinds of calculation we do here, will be 
discussed later on. 

As this construction will be local, we will work in a single coordi- 
nate patch, which we identify with IR n ; for the sake of exposition, we 
omit the coordinate maps and partitions of unity necessary to glue this 
construction into a Riemannian manifold. 

Consider once again the "half- wave equation" 31 



on M™, where A is the Laplace-Beltrami operator with respect to a 
metric g. Our goal is to find a distribution u approximately solving 

operators. We have been avoiding the topological issues necessary to easily dispose 
of such matters, however. 

31 Remember that D t = i~ 1 d t . 



6. A PARAMETRIX FOR THE WAVE OPERATOR 
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(6.1) with initial data 

u(0,x,y) = 8(x - y) 

for any y e M n . Recall that if we let U denote the exact solution to 

(6.1) with initial data S(x — y) then U can also be interpreted as (the 
Schwartz kernel of) the "solution operator" mapping initial data / to 
the solution e~ lt ^f with that initial data, evaluated at time t; this is 
why we denote it U, as we did above, and why we will often think of 
our parametrix u(t, x, y) as a family in t of integral kernels of operators 
on R n . 

We do not expect U (t, x, y) or our parametrix for it to be the Schwartz 
kernel of a pseudodifferential operator, as it moves wavefront set around, 
by Theorem 4.11; recall that pseudodifferential operators are microfo- 
cal, which is to say they don't do that. But we will try and construct 
our parametrix u(t, x, y) as something of roughly the same form, which 
is to say as an oscillatory integral 

v.{t,x,y) = J a{t,x,n)e' lt dr) 

where the main difference is that the "phase function" $ = <E>(i, x, y, rj) 
will be something a good deal more interesting than (x — y) ■ rj; indeed, 
this phase function is where all the geometry of the problem turns out 
to reside. 

First, let's write our initial data as an oscillatory integral: 

5(x-y) = (2n)~ n J e^'^ dr). 

Let us now try, as an Ansatz, modifying the phase as it varies in t, x 
by setting 

(6.2) u(t, x, y) = (2n)~ n J a(t, x, ^e^^'^ dr]; 

then if (f)(0,x,r)) = x ■ i] and a(0,x,r)) = 1, we recover our initial data; 
moreover, if were to remain unchanged as t varied we would have 
nothing but a family of pseudodifferential operators. Let us assume 
that a is a classical symbol of order in 77, so that we have an asymptotic 
expansion 

a ~ oq + 1 a_i + \i]\~ 2 a^2 + • • • , a j = a j{t, x, fj). 

Let us further assume that is homogeneous in 77 of degree 1, hence 
matches the homogeneity 32 of x • rj. 



d That is is then likely to be singular at 77 = will not in fact concern us, as it 
will turn out that we may as well assume that a vanishes near 77 = 0. 
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Now if u solves the half-wave equation, it solves the wave equation, 
hence we have 

Du = 0; 

As we seek an approximate solution, we will instead accept 

□ueC°°((-e,e) t xR"). 

Our strategy is to plug (6.2) into this equation and see what is forced 
upon us. To this end, note that if we have an expression 

(6.3) v = (2vr)- n J b(t, x, y, ^e^^-^ dr}\ 

where b is a symbol of order — oo, then v lies in as the integral 
converges absolutely, together with all its t, x, y derivatives. So terms 
of this form will be acceptable errors. 

Applying □ to (6.2), we group terms according to their order in rj. 
The "worst case" terms involve factors of r] 2 , and can only be pro- 
duced by second-order terms in □, with all derivatives falling on the 
exponential term. Since the second-order terms in A are just 



we can write the term this produces from the phase as |d x 0|g or > equiv- 
alently, |V X 0|^. Thus, the equation that we need to solve to make the 
rf terms vanish is just 

(6.4) {dt<P) 2 - \V x ct>\ 2 g = 0. 

Recall that we further want our phase to agree with the standard pseu- 
dodifferential one at time zero, i.e. we want 

(6.5) 0(0, x, rj) = x ■ rj. 

Combining this information with (6.4) we easily see that we in partic- 
ular have 

W| t=0 ) 2 = \v\l 

and we need to make an arbitrary choice of sign in solving this to get 
the initial time-derivative: we will choose 33 

(6-6) d t (f)\ t= o = -\r)\ g . 

If our metric is the Euclidean metric, we can easily solve (6.4), (6.5), 
and (6.6) by setting 

4>(t, x,rj) = x ■ rj — t\rj\. 



33 We will use this solution for reasons that will become apparent presently — it 
is the right one to solve (5.2) and not merely the wave equation. 
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More generally, the construction of a phase satisfying (6. 4), (6. 5) and 

(6.6) is the classic construction of Hamilton- Jacobi theory, and is sketched 
in the following exercise. 

Exercise 6.1. 

(1) Show that equation (6.4) is equivalent to the statement that for 
each T), the graph of d tjX (f>(t, x, rj) is contained in the set 

A = {r 2 -|^ = 0}CT*(R, xR3 

(where the variables r and £ are the canonical dual variables to 
t and x respectively). The condition (6.5) implies 

d x (j)(t,x,rj)\ t= o = rj-dx. 

Equation (6.6) gives further 

(6.7) d t:X (j)(t : x,rj)\ t= o = -\rj\ dt + rj ■ dx; 
accordingly, for fixed rj, let 

G = {t = 0,x G R n ,r = -\ri\,£ = ri} C T*(R x R n ). 

(2) Let H denote the Hamilton vector field of r 2 — \^\ 2 g . Show that 
flow along H preserves A and that H is transverse to Gq. 

(3) Show that there is a solution to (6. 4), (6. 7) for t G (— e, e) where 
the graph of d ttX <fi is given by flowing out the set Go under 
H. (Among other things, you need to check that the resulting 
smooth manifold is indeed the graph of the differential of a 
function.) Show that this solution can be integrated to give a 
solution to (6. 4), (6. 5). 

Employing the phase constructed in Exercise 6.1, we have now 
solved away the homogeneous degree-two (in rj) terms in the application 
of □ to our parametrix. We thus move on to the degree-one terms, 
which are as follows: 

(6.8) 2D t (f>D t ao - 2{D x <j>, D x } g a + n(t, x, y, r,) 

where r\ is a homogeneous function of degree 1 independent of a , i.e. 
determined completely by 0. Given that solves the eikonal equation, 
we can rewrite (6.8) by factoring out \W x (f>\ and noting that our sign 
choice d t (f> = — \V x <f>\ must persist away from t — (for a short time, 
anyway). In this way we obtain 

2d t a + 2 ( iy 3! f| J 9 *) °o - h = 0, 

I Xy'l g 9 

with f\ homogeneous of degree 0. This is a transport equation that we 
would like to solve, with the initial condition a (0, x, y, rj) = 1 (the 
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symbol of the identity operator). We can easily see that a solution 
exists with the desired initial condition a (0,y,i]) = 1, as, letting 



|V X 0| 



9 



9 



we see that H is a nonvanishing vector field, transverse to t — 0, hence 
we may solve 

Ha = ri, a | t =o = 1 
by standard ODE methods. 

Now we consider degree-zero terms in rj. We find that they are of the 
form 

2£> t 0Aa-i - 2(D x (/>, D x ) g a-i + r (t, x, y, rf) 

where r only depends on ao and <fi (i.e. not on a_i). Thus, we may use 
the same procedure as above to find a_i with initial value zero, making 
the degree-zero term vanish. (Note that the vector field H along which 
we need to flow remains the same as in the previous step.) 

We continue in this manner, solving successive transport equations 
along the flow of H so as to drive down the order in r] of the error term. 
Finally we Borel sum the resulting symbols, obtaining a symbol 

such that 

a(0,x,rj) = 1, 

and 

(6,) □^□(( 2 ,)-»/ a(t ,„)e-™ *; 

= (2rr)- n J bitiX^y^y^rt-yv) dr] E C°°((-e,e) x X), 
since b e S~°°. 

Now we need to check that (6.9) implies that in fact u differs by a 
smooth term from the actual solution. We will show soon (in the next 
section) that our choice of the phase implies that 34 WF« C {r < 0}. 
Hence, using this fact, we have 

(6.10) (d t -iVA)(d t + iv^)u = f GC°°; 

Now d t — iV~K is elliptic on r < 0, so, letting Q denote a microlocal 
elliptic parametrix, we have 

Q(d t - iVA) =I + E 



This can also be verified directly, with localization, Fourier transform, and 
elbow grease. 
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with WF' E n WFm = 0. Thus, applying Q to both sides of (6.10), we 
have 

(d t + iVK)u e C°°. 

Also, as we have arranged that a(0, x,rj) = 1, we have got our initial 
data exactly right: u(0, x,y) = 5(x — y). Letting U denote the actual 
solution operator to (5.2), we thus find 

(d t + iVA)(u - U) eC°°, u(0,x,y)-U(0,x,y) = 0; 
hence by global energy estimates 35 we have 

u-U eC°°((-e,e) xl"). 

7. The wave trace 

Our treatment of this material (and, in part, that of the previous 
section) closely follows the treatment in [7], which is in turn based on 
work of Hormander [9] . 

Recall that, if N(X) = #{A^ < A} and U(t) is given by (5.1), then 

(7.1) Tr U(t) = (2n) n/2 F(N'(\)). 

Thus, the singularities of Tr U(t) are related to the growth of N(X). 
We think that Tr U (t) should have singularities at zero, together with 
lengths of closed geodesies; since U(Q) is the identity (which has a very 
divergent trace), the singularity at t — 0, at least, seems certain to 
appear. We will thus spend some time discussing this singularity of 
the wave trace and its consequences for spectral geometry. 

What is the form of the singularity of Tr U(t) at t = 0? Our 
parametrix from the previous section was 

u(t,x,y) = (2n)- n J a(t, x, rfijWvri-vn) d v , 
where 4>(t,x,rj) = x-r] — t\r]\ g ^ + 0(t 2 ), and a(t,x,rj) = l + 0(t). Thus, 

(7.2) u(t, x, x) = (2tt)-" J a(t, x, ^e^U^ ^^ dr}, 

where we have used the homogeneity of the phase in writing the error 
term as 0(t 2 |?7|). 

Formally, we would now like to conclude that the singularity at t — 
is approximately that of 

u(t,x,x) = (2n)- n [ e- itM ^)dr] 



We can either use the estimates developed in §2.3, adapted to this variable 
coefficient setting, and with a power of the Laplacian applied to the solution (in 
order to gain derivatives); or we can apply Theorem 4.11, which is overkill. 
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so that integrating in x would give, if all goes well, 

™<«> -/«<«.*.*)«<* 

~ {2it)- n I I e~ u \<dr]dx 

(7.3) 

= (2tt)-" III e -ita\e\ g(J n-l d(J dQ dx 

J J J a>o,\e\=i 



(2tt)-"/ 2 J J ' T{a n -'H{a)){t\9\ g )dedx, 



with H denoting the Heaviside function. (Recall that the notation 
/ ~ g means that (/ / g) — > 1, in this case as t — > 0.) If we crudely 
try to solve (7.1) for N'(X) by applying an inverse Fourier transform 
to Tr U (t) and pretending that the singularity of Tr U(t) at t — is all 
that matters, we find, formally, that (7.3) yields 

N>(\)~(2n)-^T-_l x TrU(t) 

~(27r)- II ^(^-Y^dedx 
JJ\e\=i 

= (27r)-"A ri - 1 / / \9\~ n d9dx. 



Integrating would formally yield 

N(X) ~ (2ir)- n — II \6r n d6dx 
n JJ\e\=i 




(27r)" n A n / / / \6\ g n p dp dQ dx 



= (27T)" n A n / / / a^dadQdx, 

J J J\a8\ g <l 

where we have, in the last line, set a = p/\0\ g , with the result that 
definition of the region of integration now involves the metric. This 
last quantity can easily be seen to be simply the volume in phase space 
of the set |£| < 1, otherwise known as the unit ball bundle. 36 Thus, 
we obtain formally 

N(X) ~ (27r)- n A n Vol(5*X) = (27r)- n Vol({|£L < A}). 



36 Recall that on a symplectic manifold (N 2n ,io) we have a naturally defined 
volume form u> n , and it is this volume that we are integrating over the unit ball 
here. 
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This is all nonsense, of course, for several different reasons. First, we 
were very imprecise about dropping higher order terms in t in comput- 
ing the asymptotics of the trace as t — > 0. Furthermore, we formally 
computed with N' as if it were a smooth function, but of course N' 
is quite singular (a sum of delta distributions). Moreover, and poten- 
tially most seriously, there are in general infinitely many singularities 
in Tr U (t) that might be contributing to the asymptotic behavior of its 
Fourier transform: we have been concerning ourselves only with the 
one near t — 0. However: the above argument does give the right lead- 
ing order asymptotics, the so-called "Weyl Law." What follows is (the 
outline of) a rigorous version of the above argument. 

To begin, we need a cutoff function to localize us near the singularity 
at t — 0, where our parametrix is valid. 

Exercise 7.1. Show that there exists p G S(M) with p compactly sup- 
ported, p(0) = 1, p(t) = p(—t), p(X) > for all A, and p supported in an 
arbitrarily small neighborhood of 0. (Hint: Start with a smooth, com- 
pactly supported p; convolve with its complex conjugate, and scale.) 

We now consider 



J-\(p(i)Tru(i)) 

= (27i)- n -^ 2 JJ I p(t)a(t,x,r ] )e i{t ^< )+0(t2M)) dxdr ] dt 

= (2n)- n -^ 2 [[[ p(t)a(t,x,Xa9)e it ^ 1 - a+o ^\Xa) n - 1 dxdad0dt ] 



here we have used the change of variables r] = XaO with \6\ = 1. We 
now employ the method of stationary phase to estimate the asymptotics 
of the integral in t, a. If p is chosen supported sufficiently close to the 
origin, then the unique stationary point on the support of the amplitude 
is at a — 1, t — 0; we thus obtain a complete asymptotic expansion in 
A beginning with the terms 



Exercise* 7.2. Do this stationary phase computation. If you don't 
know about the method of stationary phase, this is your chance to 
learn it, e.g. from [11]. 




AX 71 ' 1 + 0{X n - 2 ) 



where 



A = n(2n)- n Vo\(B*X). 



Thus, since u-U E C°°((-e,e) x R n ), (7.1) yields 
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Proposition 7.1. 

(p * N')(X) ~ AA™ -1 + 0(A n ~ 2 ). 

We now try to make a "Tauberian" argument to extract the desired 
asymptotics of N(X) from this estimate. 

Lemma 7.2. 

N(\ + 1)-N(\) = 0(\ n - 1 ). 
Proof. By Proposition 7.1 and since iV'(A) = Yl ~~ -\j)> we nave 

^p(A-A,)~AA"- 1 + 0(A^ 2 ); 
thus, by positivity of p(A), 

( inf ' p) (#{A, : A - 1< A, < A + 1}) < ]Tp(A - A,) = 0(A™~ 1 ), 

and the estimate follows as the infimum is strictly positive. □ 

This yields at least a crude estimate: 
Corollary 7.3. 

N(X) = 0(X n ). 
A more technically useful result is: 
Corollary 7.4. 

N(X-T)-N(X)<(r) n (\) n -\ 

Exercise 7.3. Prove the corollaries. (For the latter, begin with the 
intermediate estimate (r)(|A| + |r|)" _1 .) 

Now we work harder. 

Exercise 7.4. Show that we can antidifferentiate the convolution to get 

f (p*N')(p)dp = (p*N)(X). 

J — oo 

As a result, we of course have 

(p * N){\) = AX n /n + 0(A n_1 ) = BX n + 0(A n_1 ) 

where B = A/n = (2n)- n Vo\(B*X). 
Thus, since f p(p) dp — 1, 

iV(A) = (AT * p)(A) - J (N(X - p) - N(X))p(p) dp 

= BX n + 0(A n_1 ) - J 0((p) n (A)^ 1 )p(p) rfp 
= BX n + 0{X n ~ 1 ), 



MICROLOCAL ANALYSIS 



55 



where we have used Corollary 7.4 in the penultimate equality. We 
record what we have now obtained as a theorem, better known as 
Weyl's law with remainder term. This form of the remainder term is 
sharp, and not so easy to obtain by other means. 

Theorem 7.5. 

N(X) = (27r)- n Vol(5*X)A" + 0(A n ~ 1 ). 

As noted above, it is perhaps suggestive to view the main term as 
the volume of the sublevel set in phase space {(x,(,) : <t(A)(x,£) < 
A 2 }. Weyl's law is one of the most beautiful instances of the quantum- 
classical correspondence, in which we can deduce something about a 
quantum quantity (the counting function for eigenvalues, also known 
as energy levels) in terms of a classical quantity, in this case the volume 
of a region of phase space. 

Exercise* 7.5. Show that the error term in Weyl's law is sharp on 
spheres. 

8. Lagrangian distributions 

The form of the parametrix that we used for the wave equation turns 
out to be a special case of a very general and powerful class of distri- 
butions, known as Lagrangian distributions, introduced by Hormander. 
Here we will give a very sketchy introduction to the general theory of 
Lagrangian distributions, and see both how it systematizes and extends 
our parametrix construction for the wave equation and how (in princi- 
ple, at least) it can be made to yield the Duistermaat-Guillemin trace 
formula, which gives us an explicit description of the singularities of 
the wave trace. 

We begin with a special case of the theory. 

8.1. Conormal distributions. Let X be a smooth manifold of di- 
mension n and let Y be a submanifold of codimension k. The conormal 
distributions with respect to Y are a special class of distributions hav- 
ing wavefront set 3 in the conormal bundle of Y, N*Y. Let us suppose 
that Y is locally cut out by defining functions pi, . . . , p k G C°°(X), i.e. 
that (at least locally), {pi = ■ ■ ■ = pk = 0} = Y, and dp±, . . . , dpk are 
linearly independent on Y. Then we may (locally) extend the p/s to a 
complete coordinate system 

(x 1 ,...x k ,y 1 ,...y n - k ) 



Recall that we have defined the wavefront set to lie in S*X but it is often 
convenient to regard it as a conic subset of T*X\o, with o denoting the zero-section. 
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with 

Xi = pi, . . . ,X k = Pfc, 

so that Y = {x = 0}. In these coordinates, how might we write down 
some distributions with wavefront set lying only in N*Y? Well, we can 
try to make things that are singular in the x variables at x — 0, with the 
y's behaving like smooth parameters. How do we create singularities 
at x = 0? One very nice answer is in the following: 

Lemma 8.1. Let a(£) G S%(M%) for some m. Then WFJ _1 (a) C 
iV*({0}). 

Proof. Writing 

^(a)(x) = (27T)"*/ 2 J a(Z)e**d£, 
we first note that 

J^\a)(x) G H- m ~ k ' 2 ~ e {R k ) 
for any a G and for all e > 0. Moreover for all j, 

(x l D x] )F~\a)(x) = (27t)~ k/2 J aiOU'lK )e^ x dx 
= (27r)- fc / 2 | ^a(£)e^^ 
= (2*)-*/* J frm^** d£ 

where we have integrated by parts in the final line. Note that if a G S 1 ™ 
then D^ja^)) G too (cf. Exercise 3.4). Thus we also have 

(x l D x ,)J-\a)(x) G H- m ~ k/2 ~ e (R k ). 

Iterating this argument gives 

(8.1) (x h D Xji ) . . . {x k D Xj y-\a){x) G H' m ~ k ^(R k ). 

for all choices of indices and all I G N. Thus T~ l a is smooth 38 away 
from x — 0. □ 

By the same token, we have more generally, 



38 We are of course proving more than the lemma states here: (8.1) gives a more 
precise "conormality" estimate that is valid uniformly across the origin. 
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Proposition 8.2. Let p±, . . . , pk be (local) defining functions for Y C 
X and let 

(8.2) a e S™ +{n ~ 2k)l \W x x R*) 
be compactly supported in x. Then 

(8.3) u(x) = (27r)- (n+2fc)/4 f a(x,6)e i ^ ei+ - +pA) de 

has wavefront set contained in N*Y. Moreover there exists s6i such 
that ifVi,...Vi are vector fields tangent to Y, then 

V 1 ...V l ue H s . 

Exercise 8.1. Prove the proposition. You will probably find it helpful 
to change to a coordinate system (x\, . . . , Xk, yi, ■ ■ ■ , y n -k) in which 
xi, . . . , Xk = pi,...,pk- Note that in this coordinate system, any vector 
field tangent to Y can be written 

^2a ij (x,y)x l d xj + s ^b j (x,y)d y] . 

What values of s, the Sobolev exponent in the proposition, are al- 
lowable? 

Definition 8.3. A distribution u G V(X) is a conormal distribution 
with respect to Y, of order m, if it can (locally) be written in the form 
(8.3) with symbol as in (8.2). 

While it may appear that the definition of conormal distributions 
depends on the choice of the defining functions pj, this is in fact not 
the case. The rather peculiar-looking convention on the orders of dis- 
tributions is not supposed to make much sense just yet. 

Note that examples of conormal distributions include 5(x) E W 1 
(conormal with respect to the origin), and more generally, delta dis- 
tributions along submanifolds. Also quite pertinent is the example 
of pseudodifferential operators: if A = Op^(a) e ^/ m (X) then the 
Schwartz kernel of A is a conormal distribution with respect to the 
diagonal in X x X, of order m. (This goes at least some of the way to 
explaining the convention on orders.) Indeed, we could (at some ped- 
agogical cost) simply have introduced conormal distributions and then 
used the notion to define the Schwartz kernels of pseudodifferential 
operators in the first place. 

8.2. Lagrangian distributions. We now introduce a powerful gen- 
eralization of conormal distributions, the class of Lagrangian distribu- 
tions. 39 We begin by introducing some underlying geometric notions. 



'These were first studied by Hormander [10]. 
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An important notion from symplectic geometry is that of a La- 
grangian submanifold £ of a symplectic manifold N 2n . This is a sub- 
manifold of dimension n on which the symplectic form vanishes. We 
can always find local coordinates in which the symplectic form is given 
by uj = dx l A dy % and C = {y = 0}, so there are no interesting local 
invariants of Lagrangian manifolds. 

A conic Lagrangian manifold in T*X is a Lagrangian submanifold 
of T*X\o that is invariant under the 1R + action on the fibers. (Here, o 
denotes the zero-section.) 

Among the most important examples of conic Lagrangians are the 
following: let Y C X be any submanifold; then N*Y C T*X is a conic 
Lagrangian. 

Exercise 8.2. Verify this. 

The trick to defining Lagrangian distributions is to figure out how 
to associate a phase function <f> with a conic Lagrangian C in T*X. 

Definition 8.4. A nondegenerate phase function is a smooth function 
4>(x,9), locally defined on a coordinate neighborhood of X x IR fc , such 
that is homogeneous of degree 1 in 9 and such that the differentials 
d(d(j)/d9j) are linearly independent on the set 



The phase function is said to locally parametrize the conic Lagrangian 



C 3 (x,9) h-» (x,d x tf) 
is a local diffeomorphism from C to C. 

Exercise 8.3. 

(1) Show that, in the notation of the definition above, C is au- 
tomatically a manifold, and the map C 3 (x, 9) \— > (x, d x <p) 
is automatically a local diffeomorphism from C to its image, 
which is a conic Lagrangian. 

(2) Show that if pj are definining functions for Y C X then 



is a nondegenerate parametrization of N*Y. 
(3) What Lagrangian is parametrized by the phase function used 
in our parametrix for the half-wave operator in the Euclidean 
case, given by 




C if 



= Yl Pi 6 1 



<P(t,x,y,9) = (x-y)-9-t\9\? 
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It turns out that every conic Lagrangian manifold has a local parametriza- 
tion; the trouble is, in fact, that it has lots of them. 

Definition 8.5. A Lagrangian distribution of order m with respect to 
the Lagrangian C as one that is given, locally near any point in X, by 
a finite sum of oscillatory integrals of the form 



and where is a nondegenerate phase function parametrizing C. Let 
I m (X, C) denote the space of all Lagrangian distributions on X with 
respect to C of order m. 

Note that the connection between k, the number of phase variables, 
and the geometry of £ is not obvious; indeed, it turns out that we have 
some choice in how many phase variables to use. As there are many 
different ways to parametrize a given conic Lagrangian manifold, one 
tricky aspect of the theory of Lagrangian distributions is necessarily the 
proof that using different parametrizations (possibly involving different 
numbers of phase variables) gives us the same class of distributions. 

The analogue of the iterated regularity property of conormal distri- 
butions, i.e. our ability to repeatedly differentiate along vector fields 
tangent to Y, turns out to be as follows: 

Proposition 8.6. Let u G I m (X, C). There exists s such that for any 
I G N and for any Ai, . . . , A t G ^(X) with ai(Aj)\c = 0, we have 



Of course, once this holds for one s, it holds for all smaller values; 
the precise range of possible values of s is related to the order m of 
the Lagrangian distribution; we will not pursue this relationship here, 
however. This iterated regularity property of Lagrangian distributions 
completely characterizes them if we use "Kohn-Nirenberg" symbols (as 
in Exercise 3.4) instead of "classical" ones (see [14]). 

8.3. Fourier integral operators. Fourier integral operators ("FIO's") 
quantize classical maps from a phase space to itself just as pseudodif- 
ferential operators quantize classical observables (i.e. functions on the 
phase space). The maps from phase space to itself that we may quan- 
tize in this manner are the symplectomorphisms, exactly the class of 
transformations of phase space that arise in classical mechanics. We 




where 




A l ...A l u G H S (X). 
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recall that a symplectomorphism between symplectic manifolds is a dif- 
feomorphism that preserves the symplectic form. We further define a 
homogeneous symplectomorphism from T*X to T*X to be one that is 
homogeneous in the fiber variables, i.e. commutes with the M + action 
on the fibers. 

An important class of homogeneous symplectomorphisms is those 
obtained as follows: 

Exercise 8.4. Show that the time-1 flowout of the Hamilton vector 
field of a homogeneous function of degree 1 on T*X is a homogeneous 
symplectomorphism. 

Given a homogeneous symplectomorphism $ : T*X — > T*X, con- 
sider its graph T$ C (T*X\o) x (T*X\o). Since $ is a symplectomor- 
phism, we have 

i ir L oo = l 7T R 00, 

where i is inclusion of T$ in (T*X\o) x (T*X\o), and 7r. are the left 
and right projections. If we alter T$ slightly, forming 

r$ = {(^1,6,^2,6) : (^1,6,^2,-6) e r $ }, 

and let t' denote the inclusion of this manifold, then we find that a sign 
is flipped, and 

{l')*t:* l u + {l'Y-KrU = 0; 

since Q = (vr^ + 7r|j,a;) is just the symplectic form on 

T*(X x X) =T*X x T*X, 

we thus find that r' $ is Lagrangian in T*(X x X). In fact, it is easily 
to verify that given a diffeomorphism $, T'^ is Lagrangian if and only 
if $ is a symplectomorphism. 

Exercise 8.5. Check this. 

Now we simply define the class of Fourier integral operators (of order 
m) associated with the symplectomorphism $ of X to be those opera- 
tors from smooth functions to distributions whose Schwartz kernels lie 
in the Lagrangian distributions 

r(ixi,r;). 

It would be nice if this class of operators turned out to have good prop- 
erties such as behaving well under composition, as pseudodifferential 
operators certainly do. We note right off the bat that these opera- 
tors include pseudodifferential operators, as well as a number of other, 
familiar examples: 

(1) m m (X) = I m (X x X,T' M ). 



MICROLOCAL ANALYSIS 



61 



(2) In M n , fix a and let Tf(x) = f(x — a) Then T has Schwartz 
kernel 

5(x — x' — a) 

which is clearly co normal of order zero at x — x' — a = 0. 
Note that this is certainly not a pseudodifferential operator, 
as it moves wavefront around; indeed, it is associated with the 
symplectomorphism = (x+a,£), and it it no coincidence 

that 

WFT/ = $(WF/). 

(3) As a generalization of the previous example, note that if : 
X — > X is a diffeomorphism, then we may set 

Tf(x) = /(0(x)); 

this is a FIO associated to the homogeneous symplectomor- 
phism 

<s>(x,o = (<p-\x)a;- Hx) (0) 

induced by <fi on T*X. 

Exercise 8.6. Work out this last example carefully. 

Now it turns out to be helpful to actually consider a broader class of 
FIO's than we have described so far. Instead of just using Lagrangian 
submanifolds of T*(X x X) given by V = where $ is a symplecto- 
morphism, we just require that T' be a reasonable Lagrangian (and we 
allow operators between different manifolds while we are at it): 

Definition 8.7. Let X,Y be two manifolds (not necessarily of the same 
dimension). A homogeneous canonical relation from T*Y to T*X is a 
homogeneous sub manifold T of (T*X\o) x (T*Y\o), closed in T*(X x 
Y)\o such that 

r' = {(x,Z,y,r ] ):(x,Z,y,-r ] )er} 

is Lagrangian in T*(X x Y). 

We can view Y as giving a multivalued generalization of a symplec- 
tomorphism, with 

T (V,V) = ■ (x,£,y,ri) G T}. 

and, more generally, if S C T*Y is conic, 

(8.4) F(S) = : there exists {y, V ) G S, with (x, £,</,??) G T}. 
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Definition 8.8. A Fourier integral operator of order m associated to a 
homogeneous canonical relation T is an operator from C£°(Y) to V'(X) 
with Schwartz kernel in 

I m (X x Y,V). 

Exercise 8.7. Show that a homogeneous canonical relation T is associ- 
ated to a symplectomorphism if and only if its projections onto both 
factors T*X and T*Y are diffeomorphisms. 

Exercise 8.8. 

(1) Let Y C X be a submanifold. Show that the operation of 
restriction of a smooth function on X to Y is an FIO. 

(2) Endow X with a metric, and consider the volume form dgy 
on Y arising from the restriction of this metric; show that 
the map taking a function / on Y to the distribution t— > 
f Y ( f ) \Y(y)f(y)dgY is an FIO. (Think of it as just multiplying 
/ by the delta-distribution along Y, which makes sense if we 
choose a metric.) What is the relationship between the restric- 
tion FIO and this one, which you might think of as an extension 
map? 

In the special case that T is a canonical relation that is locally the 
graph of a symplectomorphism, we say it is a local canonical graph. 

We now briefly enumerate the properties of the FIO calculus, some- 
what in parallel with our discussion of pseudodifferential operators. 
These theorems are considerably deeper, however. In preparation for 
our discussion of composition, suppose that 

Ti C T*X\o x T*Y\o, 

T 2 C T*Y\o x T*Z\o 

are homogeneous canonical relations. We say that Ti and T 2 are trans- 
verse if the manifolds 

ri x T 2 and T*X x A T * Y x T*Z 

intersect transversely in T*X x T*Y x T*Y x T*Z; here A t *y denotes 
the diagonal submanifold. 

Exercise 8.9. Show that if either Ti or T 2 is the graph of a symplecto- 
morphism, then Ti and T 2 are transverse. 

In what follows, we will as usual assume for simplicity that all mani- 
folds are compact. 40 In the following list of properties, some are special 



u In the absence of this assumption, we need as usual to add various hypotheses 
of properness. 
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to FIO's, that is to say, Lagrangian distributions on product manifolds, 
viewed as operators; others are more generally properties of Lagrangian 
distributions per se, hence their statements do not necessarily involve 
products of manifolds. In the interests of brevity, we focus on the 
deeper properties, and omit trivialities such as associativity of composi- 
tion. Note also that for brevity we will systematically confuse operators 
with their Schwartz kernels. 

(I) (Algebra property) USE I m (X x Y, r'J and T G I m ' (Y x Z, T' 2 ) 
and I\ and T 2 are transverse, then 

SoTe i m + m \x x z, (r\ o r 2 )'), 

where 

(8.5) r l0 r 2 = {(x,£,z,C) (x,£,y,ri) eF! 

and (y,r),zX) £ r2 for some (y,rj)}. 

Moreover, 

S* e I m (Y x X, (r -1 )') 

where T _1 is obtained from V by switching factors. 
(II) (Characterization of smoothing operators) The distributions in 
I~°°(X, C) are exactly those in C°°(X); composition of an oper- 
ator S E I m (X x Y, r") on either side with a smoothing operator 
(i.e. one with smooth Schwartz kernel) yields a smoothing op- 
erator. 

(Ill) (Principal symbol homomorphism) There is family of linear 
"principal symbol maps" 

(8.6) a m :nX,C)^ * +{dmiX)/ [ ' ■ 

Here L is a certain canonically defined line bundle on C (see the 
commentary below), and S™(jC;L) denotes L- valued symbols. 
We may identify the quotient space in (8.6) with 

C°°{S*C; L), 

and we call the resulting map a m instead. If S, T, are as in (I), 
with canonical relations r^I^ intersecting transversely, 

0m+m'(ST) = a m (S)a m >(T) 

and 



a m (A*) = s*a m (A), 
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where s is the map interchanging the two factors. The product 
of the symbols, at (x, £, z, Q G I\ o r 2 , is defined as 

<7m(S)(x,£,y,ri) ■ <r m >(T)(y,ri,z,C) 
evaluated at (the unique) (y,rj) such that (x,£,y,r)) G r l5 

(y,v,z,0 g r 2 . 

(IV) (Symbol exact sequence) There is a short exact sequence 

->• /"^(X, £) -> I rn {X, C) ^ C°°(S*£; L) -> 0. 

hence the symbol is if and only if an operator is of lower order. 
(V) Given £, there is a linear "quantization map" 

Op : S™ +(dimX)/ \£;L) J m (X,£) 
such that if 

oo 

E/ i\ I ^ im+(dim X)/4- j _ rfm.+ (dimX)/4/ ^ r \ 

then 

<7 m (Op(a)) = a m +(dimX)/4(^,0- 

The map Op is onto, modulo C°°(X). 
(VI) (Product with vanishing principal symbol) If P G Diff m (X) is 
self-adjoint and u G I m '(X,C), with £ C S P = {<r m (P) = 0}, 
then 

p M g / m+m '- 1 (x,/:) 

and 

v m+m >-i(Pu) = i _1 H p (<7 m /(u)), 
with Hp denoting the Hamilton vector field. 
(VII) (L 2 -boundedness, compactness) If T G I m (X x Y,T) is associ- 
ated to a local canonical graph, then 

T G £{H s {Y),H s - m {X)) for all s£i 

Negative-order operators of this type acting on L 2 (X) are thus 
compact. 

(VIII) (Asymptotic summation) Given Uj G I m ~ j (X,£), with j G N, 
there exists u G I m (X, C) such that 

j 

which means that 

N 

u-J2 u j£ I m ~ N -\X,£) 

3=0 



MICROLOCAL ANALYSIS 



65 



for each N. 

(IX) (Microsupport) The micro-support of T e I m (X x Y, V) is well 
defined as the largest conic subset T C T on which the symbol 
is 0(|f I" 00 ). We have 

WF Tu C f(WFu) 

for any distribution u on Y, where the action of T on WF u is 
given by (8.4). Furthermore, 

WF'(S o T) C WF' 5 o WF' T. 

Commentary: 

(I) This is a major result. Since FIO's include pseudodifferential 
operators, this includes the composition property for pseudo- 
differential operators as a special case. Another special case, 
when Z a point, yields the statement that an FIO applied to a 
Lagrangian distribution on the manifold Y with respect to the 
Lagrangian C C T*Y is a Lagrangian distribution associated to 
T(£), where T is the canonical relation of the FIO and T(C) is 
defined by (8.4). 

One remarkable corollary of this result is as follows: As will 
be discussed below, what our parametrix construction in §6 
really showed was that for t sufficiently small, and fixed, we 
have 

e -uVA ^ J°(X x X,Ct) 

where C t is the backwards geodesic flowout, for time t, in the 
left factor of N*A, of the conormal bundle to the diagonal in 
T*(X x X). 

Exercise* 8.10. Verify this assertion! (Try this now, but fear 
not: we will discuss this example further in §9 and you can try 
again then.) 

Now e~ lt ^ is a one-parameter group and so the composition 
property for FIO's allows us to conclude that in fact e~ %t ^ is 
an FIO for all times t, associated to the same flowout described 
above. The interesting subtlety is that while C t is an inward- or 
outward-pointing conormal bundle for small positive resp. neg- 
ative time (i.e. in the regime where our parametrix construction 
worked directly), for t exceeding the injectivity radius, it ceases 
to be a conormal bundle, while remaining a smooth Lagrangian 
manifold in T*(X x X). 
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(III) Modulo bundle factors, the principal symbol is defined as fol- 
lows: if u e I m (X, C) is given by 



then <J m (u) is defined by first restricting a(x, 6) to the manifold 



as is a nondegenerate phase function, this manifold is locally 
diffeomorphic (via a homogeneous diffeomorphism) to C, hence 
we may identify a\c with a function on C; transferring this 
function to C via the local diffeomorphism and taking the top- 
order homogeneous term in the asymptotic expansion gives the 
principal symbol. 

Much has been swept under the rug here — for a proper discus- 
sion, see, e.g., [10]. In particular, the line bundle L contains not 
just the density factors that we have been studiously ignoring— 
the Schwartz kernel of an operator from functions to functions 
on X is actually a "right-density" on X x X, i.e. a section 
of the pullback of the bundle |fi n (X)| in the right factor — but 
also the celebrated "Keller-Maslov index," which is related to 
the indeterminacy in choosing the phase function parametriz- 
ing the Lagrangian. We will not enter into a serious discussion 
of these issues here. We have also omitted discussion of the 
geometry of composing canonical relations, and the fact that 
transverse canonical relations compose to give a new canonical 
relation, with a unique point y, r] such that (x,£,y,r)) G I\, 
(y, rj, z, C) G T 2 whenever (x, £, z, C) G I\ o T 2 . 
(VI) There is a more general version of this statement valid for any 
P E ^ m (X) characteristic on £, but it involves the notion of 
subprincipal symbol, which requires some explanation; see [5, 
§5.2-5.3]. Moreover, if we are a little more honest about making 
this computation work invariantly, so that the symbol has a 
density factor in it (one factor in the line bundle L,) then we 
should really write 



where Cz denotes the Lie derivative along the vector field Z. 
(VII) This is fairly easy to prove, as if T of order m is associated to 
a symplectomorphism from Y to X, it is easy to check from 
the previous properties that T*T is an FIO associated with the 




C = {(x,e):d <j> = O}; 



MICROLOCAL ANALYSIS 



67 



canonical relation given by the identity map, and hence 
T*T G ^ 2m (Y), 

and we may invoke boundedness results for the pseudodiffer- 
ential calculus. In cases when T is not associated to a local 
canonical graph, this argument fails badly (i.e. interestingly), 
and the optimal mapping properties are a subject of ongoing 
research. 

Finally, as with the pseudodifferential calculus, we may define a no- 
tion of ellipticity for FIO's, and the above properties imply that (mi- 
crolocal) parametrices exist for the inverses of elliptic operators asso- 
ciated to symplectomorphisms. 

9. The wave trace, redux 

Let us briefly revisit our construction of the parametrix for the half- 
wave equation in the light of the FIO calculus. Here is what we did, in 
hindsight: we sought a distribution 

u G J m (K x X x X,C) 

for some Lagrangian C, and some order to, with 

u(0,x,y) = S(x - y) 

such that 

(D t + y/E x )u G r°°((-e, e) x X x X, C) = C°°((-e, e) x X x X). 
We begin by sorting out what to, the order of u, should be. Since 

u \ t=0 = 5(x -y) = (2n)- n [ e i{ - x ~ y > e d9, 

we were led us to a solution that for t small was of the form 

I a(t,x,y,9)e l ' s ' {t ^ y ' e) d9 

with a a symbol of order zero such that a(0,x,y,9) = 1, and $ a 
nondegenerate phase function such that $(0, x, y, 9) = (x — y) • 9. This 
was certainly the rough form of our earlier Ansatz; it should now be 
regarded as a Lagrangian distribution, of course. Since dim(M x X x 
X) = 2n + 1 and we have n phase variables $i, . . . , 9 n , the convention 
on orders of FIO's leads to m — —1/4. 

Now we address the following question: what Lagrangian C ought 
we to choose? Since 

n t:X G Diff 2 (M x X x X) C ^ 2 (M x X x X), 
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we a priori would have 

Ou G J 7/4 (M x X x X,C); 

as we would like smoothness of Du, we ought to start by making the 
principal symbol of Du vanish. The symbol of □ vanishes only on 

Sn = {r 2 = |£| 2 } 

hence the easiest way to ensure vanishing of the principal symbol is 
simply to arrange that 

(9.1) £CS D . 

Now, recall that our initial conditions were to be 

u(0,x,y) = S(x - y), 

where we may view this as a Lagrangian distribution on X x X with 
respect to N*A, the conormal to the diagonal: 

N*A = {(x, y, £, 7]) : x = y, £ = -rj}. 

It is not difficult to check that the requirement that u\t=o gives this 
lower-dimensional Lagrangian 41 together with the requirement (9.1) 
that C should lie in the characteristic set implies that C fl {t = 0} 
should just consist of points in £ n projecting to points in N*A, i.e. 
that we should in fact have 

C n {t = 0} = {(t = 0,r = -\v\ g ,x = y,£= -r,)} C T*(R x X x X). 

Here we have chosen the sign r = in view of our real interest, 

which is in solving 

(A + ^/a)u = o 

rather than the full wave equation; 42 we have thus kept C inside the 
characteristic set of A + V~A, which is one of the two components of 

Let Co n ow denote C fl {t = 0}. The set Co is a manifold on which 
the symplectic form vanishes (an "isotropic" manifold), of dimension 
one less than half the dimension ofT*(lxIx X). (Exercise: Check 

41 We really ought to think a bit about restriction of Lagrangian distributions 
here: this is best done by regarding the restriction operator itself as an FIO (cf. 
Exercise 8.8). We shall omit further discussion of this point, but remark that 
it should at least seem plausible that the Lagrangian manifold associated to the 
restriction is the projection (i.e. pullback under inclusion), of the Lagrangian in the 
ambient space — cf. Exercise 4.11. 

42 We have chosen to emphasize this distinction only at this critical juncture 
only because as it is in some respects more pleasant to deal with □ than with the 
half-wave operator when possible. 
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this! Most of the work is done already, as iV*(A) is Lagrangian in 
T*(X x X).) 

We now proceed as follows to find a Lagrangian (necessarily one di- 
mensional larger) containing Cq: let H = Hn denote the Hamilton vec- 
tor field of the symbol of the wave operator, in the variables (t, x,t,£). 
(I.e., take the Hamilton vector field of 0(t,x) on the cotangent bundle 
of R x X x X — nothing interesting happens in y,i].) By construction, 
Co C S n ; we now define C to be the union of integral curves of H 
passing through points in C . More concretely, these are all backwards 
unit-speed parametrized geodesies beginning at (x — y, £ — —77), where 
(x, £) evolves along the geodesic flow, and (y, 77) are fixed. (Meanwhile, 
t is evolving at unit speed, and r is constrained by the requirement 
that we are in the characteristic set so that r = — The manifold 
C stays inside S n (indeed, inside the component that is S Dt+v ^) since 
H is tangent to this manifold; moreover, C is automatically Lagrangian 
since u vanishes on C and cr 2 (n) does as well, so that for Y G TC , 
we further have 

u(Y, H) = (d(a 2 (D)), Y) = Y<r 2 (D) = 0. 

This gives vanishing of u> on the tangent space to C at points along 
t — 0; to conclude it more generally, just recall that the flow generated 
by a Hamilton vector field is a family of symplectomorphisms. 

Exercise 9.1. Check that C is in fact the only connected conic La- 
grangian manifold passing through £ an d lying in S n . (Hint: Ob- 
serve that H is in fact the unique vector at each point along £ that 
has the property u(Y, H) = for all Y e TC .) 

Thus, to recapitulate, if we obtain £ by flowing out C (the lift of the 
conormal bundle of the diagonal to the characteristic set of D t + a/A) 
along H, the Hamilton vector field of □, we produce a Lagrangian on 
which □ is characteristic. 

Exercise 9.2. Show that the phase function <f)(t,x,r]) — y ■ r] that we 
constructed explicitly in §6 does indeed parametrize 

C = {(t, t, x, e, y, -77) : r = -\£\ g , (x, f) = $ t (y, 77) 

(with $j denoting geodesic flow, i.e. the flow generated by the Hamilton 
vector field of |£| ) over \t\ 1. 

Compare our solution to the eikonal equation using Hamilton- Jacobi 
theory in Exercise 6.1 to what we have done here. 

We now remark that while our parametrization of the Lagrangian in 
§6 worked only for small t, the definition given here of £ C T*(Ix 
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X x X) makes sense globally in t, not merely for short time. When t 
is small and positive and y fixed, the projection of C to is just 

the inward-pointing conormal bundle to an expanding geodesic sphere 
centered at y; when t exceeds the injectivity radius of X, C ceases to 
be a conormal bundle, but remains a well-behaved smooth Lagrangian. 

Let us now return from our lengthy digression on the construction of 
C to recall what it gets us. Solving the eikonal equation, i.e. choosing 
C, has reduced our error term by one order, and we have achieved 

Ou G J 3/4 (M x X x X,£); 

to proceed further, we invoke Property (VI) of FIO's, to compute 

a 3/4 (Du) = i -1 H<7_i/ 4 (u); 

setting this equal to zero yields our first transport equation, and it is 
solved by simply insisting that (T_ 1 / 4 (-u) be constant along the flow, 
hence equal to 1, its value at t — (which was dictated by our 5- 
function initial data). 

Now we have achieved Ou = r_i/4 G J -1 / 4 Adding an element w_5/4 
of J~ 5 / 4 (IR x X x X, C) to solve this error away and again applying 
(VI) yields the transport equation 

i _1 H((7_ 5 /4(w_5/4)) = -<7_i/ 4 (r_i/ 4 ), 

which we may solve as before. Continuing in this manner and asymp- 
totically summing the resulting terms, we have our parametrix u G 
/-V 4 (M x X x X, C). 

Now we describe, very roughly, how to use the FIO calculus to com- 
pute the singularities of Tr U (t) at lengths of closed geodesies. 
Let T denote the operator C°°(M x X x X) ->■ C°°(R) given by 43 

T : f(t,x,y) H> / f(t,x,x)dx. 
Jx 

Thus, TrU = T(U), and we seek to identify this composition as a 
Lagrangian distribution on R 1 ; such a distribution is thus conormal to 
some set of points; as we saw above (and will see again below) these 
points may only be the lengths of closed geodesies, together with 0. 



d It is here that our omission of density factors becomes most serious: T should 
really act on densities defined along the diagonal, so that the integral over X is 
well-defined. Fortunately, U itself should be a rig/i£-density (i.e. a section of the 
density bundle lifted from the right factor); restricted to the diagonal, this yields a 
density of the desired type. 
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The Schwartz kernel of T is the distribution 



5(t - t')5(x - y) 



on R x R x X x X; it is thus conormal to t — t',x — y, i.e. is a 
Lagrangian distribution with respect to the Lagrangian 



Noting that if we reshuffle the factors into (R x X) x (R x X), the 
distribution 5(t—t')5(x—y) becomes the kernel of the identity operator, 
we can easily see that the order of this Lagrangian distribution is 0. 
Thus, 

T G I°(R x R x X x X,V) 
where the relation r : T*(R x X x X) T*R maps as follows: 



Let C be the Lagrangian for our parametrix u constructed above. 
If an interval about L e R contains no lengths of closed geodesies, 
then we see that no points in C lie over {(x, £) = (y, — 77)} for t near 
L, hence has no points over this interval, i.e. the composition 

Tu is smooth in this interval. This gives another proof of the Poisson 
relation, Theorem 5.3. 

If, by contrast, there is a closed geodesic of length L, then 



Note that in effect we get a contribution from every (x,£) lying along 
the geodesic, and that in particular, the fiber over (L, r) of the projec- 
tion on the left factor 



(giving the composition T(£)) consists of at least a whole geodesic of 
length L, rather than a single point. Thus, the composition of these 
canonical relations is not transverse and the machinery described thus 
far does not apply. In [3], Duistermaat-Guillemin remedied this defi- 
ciency by constructing a theory of composition of FIO's with canonical 
relations intersecting cleanly. 

Definition 9.1. Two manifolds X, Y intersect cleanly if X H Y is a 
manifold with T(X D Y) — TX n TY at points of intersection. 

For instance, pairs of coordinate axes intersect cleanly but not trans- 
versely in R n . In general, in the notation of Property (I), if the intersec- 
tion of the product of canonical relations T 1 x T 2 with the partial diago- 
nal T*X x A x T*Z is clean, we define the excess, e, to be the dimension 



{t = t',x = y,r 



-t',£ = -v} 




{(L,r):r<0}er(£). 



(T*R x A T *^ xX xx)xT*(RxXxx)) H (r x C) — > T*R 
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of the fiber of the projection from this intersection to T*X x T*Z; this 
is zero in the case of transversality. Duistermaat-Guillemin show: 



i.e. composition goes as before, but with a change in order. In addition 
the symbol of the product is obtained by integrating the product of the 
symbols over the e-dimensional fiber of the projection in what turns 
out to be an invariant way. 

Let us now assume that there are finitely many closed geodesies 
of length L, and that they are nondegenerate in the following sense. 
For each closed bicharacteristic (i.e. lift to S*X of a closed geodesic) 
7 C S*X, pick a point p G 7 and let Z C S*X be a small patch of a 
hypersurface through p transverse to 7. Shrinking Z as necessary, we 
can consider the map P 7 : Z — > Z taking a point to its first intersection 
with Z under the bicharacteristic flow on S*X. This is called a Poincare 
map. Since -P 7 (p) = p, we can consider dP 1 : T P Z — > T P Z. We say that 
the closed geodesic is nondegenerate if Id — dP 1 is invertible. Note that 
this condition is independent of our choices of p and Z, as are the 
eigenvalues of Id — <iP 7 . 

The following is due to Duistermaat-Guillemin [3]: 

Theorem 9.2. Assume that all closed geodesies of length L on X are 
nondegenerate. Then 



where P 1 is the Poincare map corresponding to the geodesic 7, and a 1 
is the number of conjugate points along the geodesic. 

A proof of this theorem requires understanding the symbol of the 
clean composition Tu (where u is our parametrix for the half-wave 
equation). This lies beyond the scope of these notes. We merely note 
that we are in the setting of clean composition with excess 1, hence 
locally near t = L, 



Sot e / m + m '+ e / 2 (x x z, (r\ o r 2 )') 




7 of length L 



Tue I°~ 1/4+1/2 (R, {t = L,r<0}). 
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This Lagrangian is easily seen to be parametrized, locally near t = L, 
by the phase function with one fiber variable 44 

hence we may write 

POO 

Tu = (2tt)- 3 / 4 / a(t, 9)e~^- L ^ e d9, 
Jo 

where o 6 x I) has an asymptotic expansion a ~ ao + \9\ 1 a-i + 
.... Our task is to find the leading-order behavior of Tu, and this is of 
course dictated by its principal symbol. To top order, a is given by the 
constant function ao(L, 1), hence Tu is (to leading order) a universal 
constant times a (L, 1) times the Fourier transform of the Heaviside 
function, evaluated at t — L. Thus, the limit in the statement of the 
theorem is, up to a constant factor, just the value of a (L, 1). The 
whole problem, then, is to compute the principal symbol of this clean 
composition, and we refer the interested reader to [3] for the (rather 
tricky) computation. 45 



10. A GLOBAL CALCULUS OF PSEUDODIFFERENTIAL OPERATORS 

10.1. The scattering calculus on IR n . We now return to some of 
the problems discussed in §2, involving operators on noncompact man- 
ifolds. Recall that the Morawetz estimate on IR n , for instance, hinged 
upon a global commutator argument, involving the commutator of the 
Laplacian with (l/2)(D r +.D*) on R™. Generalizing this estimate to non- 
compact manifolds will require some understanding of differential and 
pseudodifferential operators that is uniform near infinity. Recall that 
thus far, we have focused on the calculus of pseudodifferential operators 
on compact manifolds; in discussing operators on W\ we have avoided 
as far as possible any discussion of asymptotic behavior at spatial infin- 
ity. Thus, our next step is to discuss a calculus of operators — initially 
just on IR n — that involves sensible bounds near infinity. 

Thus, let us consider pseudodifferential symbols defined on all of 
T*M. n with no restrictions on the support in the base variables, with 



This phase function should of course be modified to make it smooth across 
8 = 0, but making this modification will only add a term in C°°(IR) to the Lagrangian 
distribution we write down. 

45 We note that the factor i a -< is the contribution of the (in)famous Keller-Maslov 
index, and is in many ways the subtlest part of the answer. 



74 



JARED WUNSCH 



asymptotic expansions in both the base and fiber variables, both sepa- 
rately and jointly. To this end, note that changing to variables 

, and £ amounts to compactifying the base and fiber variables of 
T*M ra radially, to make the space B™ x B^, with B n denoting the closed 
unit ball. (Recall that we defined a radial compactification map in 
(3.4), and that while and are what we should really use as 

defining functions for the spheres at infinity, and are accept- 
able substitutes as long as we stay away from the origin in the corre- 
sponding variables.) The space B n x B n is a manifold with codimension- 
two corners, i.e. a manifold locally modelled on [0, 1) x [0, 1) x M 2n ~ 2 ; 
its boundary is the union of the two smooth hypersurfaces S^ 1 x B^ 
and B™ x S^ -1 . In our local coordinates, and |£| _1 are the defining 
functions for the two boundary hypersurfaces, i.e. the variables locally 
in [0, 1), while a choice of n — 1 of each of the x and £ variables gives 
the remaining M n ~ 2 . 



B n x gn-1 S n-1 x g: 

I*. 



S 11 ' 1 x B 



Figure 2. The manifold with corners B n x B n in the 
case n — 1. At the top (and bottom) are the boundary 
faces from B n x S'^ 1 arising from the compactification 
of the second factor — this is "fiber infinity." At left (and 
right) are the faces from S^ 1 x B n , arising from com- 
pactification of the first factor — this is "spatial infinity." 
The corner(s) at which these faces meet is S 71 ^ 1 x S'^ 1 . 
The functions p = \x\ _1 and a = l^p 1 can be locally 
taken as defining functions for the spatial infinity resp. 
fiber infinity boundary faces. The disconnectedness of 
B n x S^ 1 and S'^ 1 x B n is of course a feature unique 
to dimension one. 
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We now let 46 

S™' l (T*R n ) 

denote the space of a e C°°(T*R n ) such that 47 
(10.1) (0~ m fa)~ l a e C°°(B n x B n ). 

This condition gives asymptotic expansions (i.e., Taylor series) in var- 
ious regimes: 
(10.2) 

a(x,0 ~ kl^X-^'O. as a; -> oo, ( G V d M" = (B n )° 

afo ~ XI l a; | , ~*l^r~ Ja ij( £ > 0> asa; .C^ oo. 
Finally, let 

denote the space consisting of the (left) quantizations of these symbols. 
The "sc" stands for "scattering." 48 

This is an algebra of pseudodifferential operators, containing all or- 
dinary pseudodifferential operators on M, n with compactly supported 
Schwartz kernels. The algebra of scattering pseudodifferential opera- 
tors enjoys all the good properties of our usual algebra, plus some more 
that derive from its good behavior at infinity. We can compose oper- 
ators to get new operators, and if A e V™' l (R n ), B e ^> l '(R n ), we 
have AB e \&™+ m ' (W 1 ) . Likewise, adjoints preserve orders. What is 
novel here, however, is the principal symbol map. 

As the symbols defined by (10.1) are those that, up to overall factors, 
are smooth functions on B n x B n , we can define the principal symbol 
of order m, I of the operator Op(a) as 

this can be further split into pieces corresponding to the restrictions to 
the two boundary hyp ersurf aces: 



This space should really be called S^' sc , with the cl once again indicating 
"classicality" (as opposed to Kohn-Nirenberg type of estimates alone). We omit 
the cl so as not to clutter up the notation. 

47 We are abusing notation here by ignoring the diffeomorphism of radial com- 
pactification, thus identifying C°°(B n x B n ) directly with a space of functions on 
E™ x E™. 

48 This is a space of operators considered by many authors; as we are following 
roughly the treatment of Melrose [18], we have adopted his notation for the space. 
Note, however, that we have reversed the sign from his convention for the order I. 
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where 

is nothing but the ordinary principal symbol, rescaled by a power of 
(x), and 

^(iKx.Oer^xB") 

is the novel piece of the symbol, measuring the behavior of the operator 
at spatial infinity. Note that these two pieces of the principal symbol 
are not independent: they must agree at the corner, S" 1-1 x S^ 1 . We 
may also choose to think of the principal symbol as 

a m,l (A) e S'™' z (T*]R n )/S'™~ 1 '^ 1 (T*]R n ), 

and we will often confuse the symbol with its equivalence class; this 
is usually less confusing than keeping track of the rescaling factor 
(0 m (x) 1 . 

The principal symbol short exact sequence thus reads: 

-> ^-^-'(R") -> ^(M n ) ^ C°°(d(B n x B n )) ->■ 0. 

Thus, vanishing of this symbol yields improvement in both orders at 
once; correspondingly, vanishing of one part of the symbol gives im- 
provement in just one order: 

->• ^-^(R") ->■ ^ m ''(M n ) % l C°°(B n x S 71 - 1 ) -> 0, 

->■ ^•'- 1 (M n ) ->• ^ m ''(M n ) ^ C 00 ( 1 S"- 1 x B n ) 0. 

The symbol of the product of two scattering operators is indeed the 
product of the symbols, 49 as (equivalence classes of) smooth functions 
on d{B n x B n ). 

The symbol of the commutator of two scattering operators (which 
is of lower order than the product in both nitrations) is, as one might 
suspect, given by % times the Poisson bracket of the symbols. 

The residual calculus is particularly nice in this setting: instead of 
merely consisting of smoothing operators, it consists of operators that 
are "Schwartzing" — they create decay as well as smoothness: 

R e tf" 00 -°°(R n ) <^> R : <S'(R n ) ->■ S(R n ). 

One problem with using the ordinary calculus for global matters is 
that we can only conclude compactness of operators of negative order 

49 It is exactly this innocuous statement, which the reader might think routine, 
that separates the scattering calculus from many other choices of pseudodiffcrcn- 
tial calculus on noncompact manifolds: typically the "symbol at infinity" (here 
will compose under operator composition in a more complex, noncom- 
mutative way. 
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for compactly supported operators. Here, we have a much more precise 
result: 

Proposition 10.1. An operator in \I/^ (lR n ) is bounded on L 2 (IR n ); an 
operator of order (to, I) with m,l < is compact on L 2 (IR n ). 

Associated to the expanded notion of symbol, there is are associated 
notions of ellipticity (nonvanishing of the principal symbol) and of WF' 
(lack of infinite order vanishing of the total symbol). We have an 
associated family of Sobolev spaces: 

u G H™' l (R n ) •<=>- \/A G ^™''(M n ), Au G L 2 iW). 

Operators in the calculus act on this scale of Sobolev spaces in the 
obvious way. Since smoothing operators are "Schwartzing," it is not 
hard to see that 

(We will return to an explicit description of these Sobolev spaces shortly.) 
There is also an associated wavefront set: 

WF sc m C d(B n x B n ) 

is defined by 

p <£ WF sc m there exists A G elliptic at p, with Au G S. 

In (5™)° x S^ 1 C d(B n x B n ), (i.e., in the usual cotangent bundle of 
M ra ) this definition just coincides with ordinary wavefront set; but "at 
infinity," i.e. in S*™ -1 x B^, it measures something new. To see what, 
let us consider some examples. 

Example 10.2. 

(1) Constant coefficient vector fields on W 1 : If v G R n and P = 
i~ 1 v ■ V, then, we can write 

the principal symbol is thus 

(2) Likewise, the symbol of the Euclidean Laplacian A is cr 2j o(A) = 
|£| 2 . Note that the Laplacian is not elliptic in the scattering cal- 
culus, as its principal symbol vanishes at £ = on the bound- 
ary face S*™ -1 x . This should come as no suprise, as A has 
nullspace in iS'(!R n ) (given by harmonic polynomials) that does 
not lie in L 2 , hence is not consistent with elliptic regularity in 
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the scattering calculus sense: if Q is elliptic in the scattering 
calculus, 

Qu G S(R n ) ==>- u G «S(R n ). 

On the other hand, consider Id+A. We have Id G ^^(R"), 
hence adding it certainly does not alter the "ordinary" part of 
the symbol, living on (B n )° x S 71 ^ 1 . But it does affect the symbol 
in S 11 ^ 1 x B n : we have 

a 2)0 (Id+A) = l + |e| 2 ; 

Id + A is an elliptic operator in the scattering calculus, and of 
course it is the case that (Id+A)w G «S(R n ) implies that u is 
likewise Schwartz. 

(3) If we vary the metric from the Euclidean metric to some other 
metric g, we may or may not obtain a scattering differential 
operator; for example, if g were periodic, we certainly would 
not, as the total symbol of A would clearly lack an asymptotic 
expansion as \x\ —> oo. Suppose, however, that we may write in 
spherical coordinates on M. n 

g = dr 2 + r 2 J2 h ij (r-\6)d6 i de j for r > R > 0. 
where hij is a smooth function of its arguments, and 

h ij (o,e)de i de j 

is the standard metric on the "sphere at infinity." We will call 
such a metric asymptotically Euclidean. Then the corresponding 
Laplace operator is in the scattering calculus. 

Exercise 10.1. Check that this operator does lie in the scattering 
calculus. 

Let A denote the Laplacian with respect to an asymptotically 
Euclidean metric. Then 

(Id+A)- 1 g * s " c 2 '°(R n ). 

(4) (x) 2 (Id+A) G ^ 2 (M n ) and has symbol (x) 2 (l + |£| 2 ). This is 
globally elliptic. 

By the last example, we find that 

u G H 2 f(R n ) ^ (x) 2 (ld+A)u G L 2 (R n ); 

interpolation and duality arguments allow us to conclude more gener- 
ally that the scattering Sobolev spaces coincide with the usual weighted 
Sobolev spaces: 

H™' l (R n ) = ( X y l H m (M. n ). 
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We now turn to some examples illustrating the scattering wavefront 
set. Consider the plane wave 

u(x) = e iax . 

We have 

(D x j — atj)u — for all j — 1, . . . , n. 

The symbol of the operator D x j — otj is £j — aij, hence the intersection of 
the characteristic sets of these operators is just the points in S n ~ 1 x B n 
where £ = a. As a consequence, we have 

WF sc (e ia - x ) C {{x,0 G S'"- 1 x W 1 : £ = a} 

(here we are as usual identifying (B n )° = M ra ). In fact this containment 
turns out to be equality, as we see by the following characterization of 
scattering wavefront set. 

Proposition 10.3. Let p = (x ,fo) e S'^ 1 x Rn - We 

p i WF sc m 

i/ and only if there exist cutoff functions <fi G C^°(M. n ) nonzero at £ and 
7 G C°°(lR n ) nonzero in a conic neighborhood of the direction x such 
that 

(pFiju) g S(R n ). 

This is of course closely analogous to the characterization of ordinary 
wavefront set in Proposition 4.5, and is proved in an analogous manner. 
Note that if u is a Schwartz function in a set of the form 

{ 1 1 — r - x \ < e, \x\ > R } 

for any e > 0, R ^> 0, then there is no scattering wavefront set at 
points of the form (x ,0 for any £ G R n . Thus, this new piece of 
the wavefront set measures the asymptotics of u in different directions 
toward spatial infinity: xq provides the direction, while the value of £o 
records oscillatory behavior of a specific frequency. 

There is also, of course, a similar characterization of WF SC u inside 
S*" -1 x S™ -1 . We leave this as an exercise for the reader. 

10.2. Applications of the scattering calculus. As an example of 
how we might use the scattering calculus to obtain global results on 
manifolds, let us return to the local smoothing estimate from §2.1. Re- 
call that if ip satisfies the Schrodinger equation (2.1) on R n with initial 
data G H 1 / 2 , this estimate (or, at least, one version of it) tells us 
that 



(10.3) 
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hence the solution is (locally) half a derivative smoother than the data, 
on average. How might we obtain this estimate on a manifold, with 
A replaced by the Laplace-Beltrami operator (which we also denote 
A)? For a start, note that (10.3) fails badly on compact manifolds; in 
particular, recall that since [A, A s ] = for all s G R, the H s norms are 
conserved under the evolution, hence if tp ^ H s , with s > 1/2, then 
we certainly do not have 50 tp G Lf oc (M. t ; H s ). So if we seek a broader 
geometric context for this estimate, we had better try noncompact 
manifolds. 

Recall that we initially obtained the estimate by a commutator ar- 
gument with the Morawetz commutant 



which actually gave more information; we noted that we could, instead, 
have used a simpler commutant f(r)D r , with f(r) = near r = 0, 
nondecreasing, and equal to 1 for r > 2 (say): this gives a commutator 
with a term 

x{r)Dl 

which, when paired with ip and integrated in time, tests for H 1 regu- 
larity in an annular neighborhood of the origin (which could have been 
translated to be anywhere); other terms in the commutator are posi- 
tive also, modulo estimable error terms, and we thus obtain the local 
smoothing estimate. Generalizing this is tricky, as the positivity of the 
symbol of the term 

i[A,D r ] 

on R™ is delicate: the symbol of this commutator is given by the Poisson 
bracket 

m 2 ^-£} = 2^d x ^-x) = ^\ 2 -(^x) 2 ) 

which is nonnegative but does actually vanish at £ || x, i.e. in radial 
directions. If we perturb the Euclidean metric a bit, and replace |£| 2 
with |£| 2 , the symbol of the Laplace-Beltrami operator, but leave the 
inner product (£, x) = then this computation fails to give pos- 

itivity. So we have to be more careful. We might try to adapt Yl 
to the new metric instead, but this is problematic, as it doesn't really 

50 Note that this argument fails on R™ exactly because of the distinction between 
local and global Sobolev regularity: there is nothing preventing a solution on R™ 
with initial data in H 1 / 2 from being locally H 1 — or even smooth on arbitrarily 
large compact sets — in return for having nasty behavior near infinity. 
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make much invariant sense. Moreover, it seems even more problem- 
atic upon interpretation: what positivity of a} means is just that 
a is increasing along the bicharacteristic flow of i.e. is increasing 
along (the lifts to the cosphere bundle of) geodesies. This is clearly 
impossible if there are any closed (i.e., periodic) geodesies, or indeed 
if there are geodesies that remain in a compact set for all time, hence 
our difficulty in obtaining an estimate on compact manifolds. 

Exercise 10.2. Suppose that a geodesic 7 remains in a compact subset 
of M. n (equipped with a non-Euclidean metric) for all t > 0. Let p = 
(7(0)5 (t'(0))*) £ T*R n (with * denoting dual under the metric). Show 
that there cannot exist a smooth a E C°°(T*R n ) with {\£\ 2 g , a} > e > 
and a(p) 7^ 0. 

Definition 10.4. Let g be an asymptotically Euclidean metric on R n , 
and let 7 be a geodesic. We say that 7 is not trapped forward/backward 
if 

lim \j(t)\ = 00. 

t— >±oo 

We say that 7 is trapped if it is trapped both forward and back- 
ward. We also use the same notation for the bicharacteristic pro- 
jecting to 7. Moreover, we say that a point in S*M. n along a non- 
(forward/backward)-trapped geodesic is itself non-(forward/backward)- 
trapped. 

It is a theorem of Doi [4] that the local smoothing estimate (10.3) 
cannot hold near a trapped geodesic. (The total failure of (10.3) on 
compact manifolds should make this plausible, but it turns out to be 
considerably more delicate to show that it fails even if the only trapping 
is, for instance, a single, highly unstable, closed geodesic.) As a result 
we will require some strong geometric hypotheses in in order to find a 
general context in which (10.3) holds. 

The following is a result of Craig-Kappeler-Strauss [1]: 

Theorem 10.5. Consider tp a solution to the Schrodinger equation 
on asymptotically Euclidean space, with ^0 e # 1/2 (M n ). The estimate 
(10.3) holds microlocally at any (xq, £q) that lies on a nontrapped bichar- 
acteristic, i.e. for any A G ^(R™) compactly supported and microsup- 
ported sufficiently near to (xo,£o), we have for any T > 0, 51 

T ||^|| 2 rft<||^o||?, 1 / 2 - 



51 More generally, we can replace the Sobolev exponents 1/2 and 1 by s and 
s + 1/2 respectively; in particular, L 2 initial data gives an L 2 H 1 / 2 estimate. 
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Proof. We will prove the theorem by using a commutator argument in 
the scattering calculus. To begin, we recall from Exercise 4.21 that the 
set along which microlocal Lif oc H l regularity holds is invariant under 
the geodesic flow. Hence it suffices just to obtain regularity of this form 
somewhere along the geodesic 7. The convenient place to do this is out 
near infinity. 

In order to make a commutator argument, note that it is very useful 
to have a quantity that behaves monotonically along the flow. We refer 
to points in T*M n near infinity (i.e. for \x\ ^> 0) as incoming if £ -x < 
and outgoing if £ • x > (this corresponds to moving toward or away 
from the origin, respectively, under asymptotically Euclidean geodesic 
flow). Heuristically, under the classical evolution, points move from 
being incoming to being outgoing. More precisely, we observe that the 
Hamilton vector field of p = 02,0 (A) is given by 

h p = - £ t& dJ ^ d ^ + 2 E ^ (x)d xj . 

Recalling that g 1 ^ has an asymptotic expansion with leading term given 
by the identity metric, we can write this as 

(10.4) H p = 2i ■ d x + 0{\x\~ l \i\)d x + 0{\x\- l \i\ 2 )d^ 

(where in fact the whole vector field is homogeneous of degree 1 in £). 

Exercise 10.3. Verify (10.4). 

Thus, 

h p (£-x) = ||(i-(£-£) 2 )+o(|£|N- 1 ). 

This is thus positive, as long as £-x is away from ±1, and \x\ is large, 52 
i.e., as long as we stay away from precisely incoming or outgoing points. 
Thus, we manufacture a scattering symbol for a commutant that has 
increase owing to the increase in "outgoingness:" Let x( s ) denote a 
smooth function that equals for s < 1/4 and 1 for s > 1/2, with x' 
a square of a smooth function, nonzero in the interior of its support. 
Let Xs(s) = x(.$ s )- We choose 

a(x,Z) = \Z\ g x(-£'£)xt(\x\)x(\Z\ a )- 

Thus a is supported at incoming points at which \x\ > 1/(45) ^> 0; 
the first x factor localizes near incoming points, and the factor of xs 
keeps \x\ large. (The factor simply cuts off near the origin in 

£ to yield a smooth symbol.) Under the flow on the support of a, x 



Largeness of £ plays no role because of homogeneity of the Hamilton vector 
field of the principal symbol of A. 
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tends to decrease and we become more outgoing, so the tendency is the 
leave the support of a along the flow. This is the essential point in the 
following: 

Exercise 10.4. Check that a G S^°(T*R n ) and that if 5 is chosen suffi- 
ciently small, we may write 

H p a = -b 2 - c 2 

where 

(1) b G Sh~ 1/2 {T*W l ) is supported in suppx'(-| • x)xs(\x\) 

(2) c G S , sc _1/2 (T*lR n ) is supported in supp%(-| ' x )x'si\ x \) an d 
nonzero on the interior of that set. 

(Note that |£| is annihilated by H p , so the terms containing |£| simply 
do not contribute.) 

Now let A G ^^(R™) have principal symbol a. Then we have 
i[A, A] = —B*B - C*C + R 

with B = Op(6), C = Op(c) G *^" 1/2 (R n ), and R G *^" 2 (R n ). 
Hence, 



\\Ci,\\ 2 dt< (AtM 



T 

(R^,^)dt 



As (Aip, ip) is bounded by the L°°H 1 / 2 norm of ip and hence by H^oIIh 1 / 2 ' 
and the R term likewise, 53 we obtain 

(10-5) [ T \\CiP\\ 2 dt<Uo\\ 2 Hl/2 . 

Jo 

Exercise* 10.5. Show that for any R > 0, there exists 5 > sufficiently 
small that if (xo,£o) e T*R™ fl < Rq} lies along a non-backward 
trapped bicharacteristic, some point on that bicharacteristic with t <^0 
lies in ellC, with C = Op(c) constructed as above. 

Thus, rays starting close to the origin that pass through \x\ ~ 5~ l for 
t are incoming when they do so. This is an exercise in ODE. You 
might begin by showing that if a backward bicharacteristic starting in 
{\x\ < R } passes through the hypersurface \x\ = Rl with Rl ^> 0, 



53 In fact, the R term is considerably better than necessary for this step, as it has 
weight —2 rather than just (which would be all we need to obtain the estimate). 
The astute reader may thus recognize that we are far from using the full power of 
the scattering calculus here. A proof of the global estimate in Exercise 10.6 requires 
a more serious use of the symbol calculus, however, as do the estimates which are 
the focus of [1] , which show that microlocal decay of the initial data yields higher 
regularity of the solution along bicharacteristics. 
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then it must have £ • x < there, and that £ • x will keep decreasing 
thereafter along the backward flow. 

Given a non-backward-trapped point q G S*M. n , Exercise 10.5 tells 
us that we may construct a commutant A as above so that the commu- 
tator term C is elliptic somewhere along the bicharacteristic through q. 
Equation 10.5 tells us that we have the desired L 2 H l estimate on ellC, 
and the flow-invariance from Exercise 4.21 yields the same conclusion 
at q. Thus, we have proved the desired result at non-backward-trapped 
points. It remains to consider non-forward-trapped points. 

Suppose, then, that q = (xo,£,o) £ T*M n is non-forward-trapped; 
then note that q' = (xo, — £o) is non-backward-trapped. Consider then 
the function if) : if 

(A + A)ip = 

then 

(-A + A)ip = 0, 

i.e. 

ip(t,x) = tp(T-t,x) 
again solves the Schrodinger equation. Of course, by unitarity, 

||^(0,^)||^i/ 2 = H^olljyi/a- 

Since q' is non-backward trapped, we thus find that there exists C G 
^ S 1 r 1/2 (M™), elliptic at q', with 









r 

/o 




2 dt< ||^(0, x)|| 



on the other hand, 



\\Crl>(T-t r ) 
\\C1>(T-t, 



where 



C = Op,(c(x,0), and C = Op t (c(x, -£)); 

thus, C tests for regularity at g, and we have obtained the desired 
estimate at q. □ 



Corollary 10.6. On an asymptotically Euclidean space with no trapped 
geodesies, the local smoothing estimate holds everywhere. 
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Exercise* 10.6. (Global (weighted) smoothing.) Show that if there are 
no trapped geodesies, and tpo G L 2 , we have 



for every e > 0. (This is a bit involved; a solution can be found, e.g., 
in Appendix II of [8].) 

10.3. The scattering calculus on manifolds. We can generalize the 
description of the scattering calculus to manifolds quite easily, following 
the prescription of Melrose [18]. Let X be a compact manifold with 
boundary. We will, in practice, think of the interior, X°, as a non- 
compact manifold (with a complete metric) that just happens to come 
pre-equipped with a compactification to X. Our motivating example 
will be X = B n , where X° is then diffeomorphically identified with W l 
via the radial compactification map. Recall that on M. n , radially com- 
pactified to the ball, we used coordinates near S^ -1 , the "boundary at 
infinity," given by 

1 x 

p — n> 9 = FT' 

\x\ \x\ 

where in fact p together with an appropriate choice of n — 1 of the 0's 
furnish local coordinates near a point. In these coordinates, what do 
constant coefficient vector fields on IR n look like? We have 



Recall moreover that functions in C°°(B n ) correspond exactly, under 
radial (un) compactification, to symbols of order zero on R™. So in fact 
it is easy to check more generally that vector fields on IR n with zero- 
symbol coefficients correspond exactly to vector fields on B n that, near 
S*™" 1 , take the form 



with a,bj G C°°(B n ). 

We generalize this notion as follows. Given our manifold X, let 
p G C°°(X) denote a boundary defining function, i.e. 



Let 0i be local coordinates on dX. We define scattering vector fields on 
X to be those that can be written locally, near dX, in the form 




d xi =pd oj -pY,0 k i do>>-P 2 j dp- 



a 



(p,e)p 2 d p + J2bM 9 )pd< 



p > on X, p _1 (0) = dX, dp ^0 on dX. 



a 



(p,0)p 2 d p + J2bj(P^)pdt 
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with a,bj G C°°(X). Let 

V SC (X) = {scattering vector fields on X} 

Exercise 10.7. 

(1) Show that Vsc(X) is well-defined, independent of the choices of 

P ,e. 

(2) Let Vb(X) denote the space of smooth vector fields on X tan- 
gent to dX. Show that 

V SC (X) = P V h (X) 

(3) Show that both V SC (X) and V\,{X) are Lie algebras. 

As we can locally describe the elements of V SC (X) as the C°°-span of 
n vector fields, V SC (X) is itself the space of sections of a vector bundle, 
denoted 

S c TX 

There is also of course a dual bundle, denoted 

SC T*X, 

whose sections are the C°°-span of the one-forms 

dp (MP_ 

9"' ■ 
P P 

Over X°, we may of course canonically identify SC T*X with T*X, and 
the canonical one-form on the latter pulls back to give a canonical 
one-form 

Ap d0 

10.6 i^+V- 

P 2 P 

defining coordinates £, 77 on the fibers of SC T*X. 

The scattering calculus on W l is concocted to contain scattering vec- 
tor fields: 

Exercise 10.8. Show that * s ^(R n ) D V sc {B n ). 

We can, following Melrose, define the scattering calculus more gen- 
erally as follows. Let SC T X denote the fiber- compactification of the 
bundle SC T*X, i.e. we are radially compactifying each fiber to a ball, 
just as we did globally in compactifying T*W n to B n x B n , only this 
time, the base is already compact. Now let 

S™' l { sc T*X) = a- m p- l C°°( sc T*X), 
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where a is a boundary defining function for the fibers. We can (by dint 
of some work!) quantize these "total" symbols to a space of operators, 
denoted 

(Note that in the case X = B n , we recover what we were previously 
writing as \I/^(R n ); the latter usage, with IR n instead of the more 
correct B n , was an abuse of the usual notation.) The principal symbol 
of a scattering operator is, in this invariant picture, a smooth function 
on d( sc T X); or equivalently, an equivalence class of smooth functions 
on or, in the partially uncompactified picture, an equivalence 

class of smooth symbols on SC T*X. (It is this last point of view that 
we shall mostly adopt.) In the coordinates defined by the canonical 
one-form (10.6), we have 

(10.7) a lfi (p 2 D p )=Z, a lfi ( P D ei )=ri j . 

Recall that the Euclidean metric may be written in polar coordinates 

as 

dOcr 1 ) 2 + (p _1 ) 2 MM0) 

with h denoting the standard metric on S' n_1 . We can generalize this 
to define a scattering metric as one on a manifold with boundary X 
that can be written in the form 

dp 2 h(p,9,d9) 

~P T + ? 

locally near dX, with p a boundary defining function, and h now a 
smooth family in p of metrics on dX. 54 

Exercise 10.9. 

(1) Show that if g is a scattering metric on X, then the Laplace 
operator with respect to g can be written 

A = (p 2 D p ) 2 + 0(p 3 )D p + p 2 A e 

where Ag is the family of Laplacians on dX associated to the 
family of metrics h(r, 6, d6). 

(2) Show that for A G C, 

^ 2 ,o(A - A 2 ) =e + \v\l- A 2 . 

(Note that this entails noticing that you can drop the 0(p 3 )D p 
terms for different reasons at the the two different boundary 
faces of SC T X. The term —A 2 is of course only relevant at the 

54 The usual definition, as in [18], is a little more general, allowing dp terms 
in h; however, it was shown by Joshi-Sa Barrcto that these terms can always be 
eliminated by appropriate choice of coordinates. 
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p = face; it does not contribute to the part of the symbol at 
fiber infinity, as it is a lower-order term there.) 

As a consequence of Exercise 10.9, note as before that for A £ R, the 
Helmholz operator A — A 2 is not elliptic in the scattering sense: there 
are points in sc Tq X X where £ 2 + \rj\ 2 h = A 2 . 

We now turn to scattering wavefront set WF SC , which can, as one 
might expect, be defined in the usual manner as a subset of 

d( sc T*X), 

hence is a subset of boundary faces at fiber infinity and at spatial infin- 
ity (i.e., over dX). The scattering wavefront set is the obstruction to a 
distribution lying in C°°(X), where C°°(X) denotes the set of smooth 
functions on X decaying to infinite order at dX. This space is the ana- 
logue of the space of Schwartz functions in our compactified picture: 

Exercise 10.10. Show that pullback under the radial compactification 
map sends C°°(B n ) to S(R n ). 

By (10.7), it is not hard to see that 

{p 2 D p - a )u = 0^ WF SC «c{p = 0,( = a}, 

(pD ej - p)u = =s> WF SC u d {p — 0, Vj = (3}. 

The following variant provides a useful family of examples (and can be 
proved with only a little more thought): if a(p, 9) and (p(p : 9) £ C°°(X), 
then 55 

WF SC (a(p,9)e i ^ p ' e)/p ) = {(p = Q,e,d{<f>{p,ff)lp) : (0,9) £ ess-suppa}, 

where ess-suppa C dX denotes the "essential support" of a, i.e. the 
points near which a is not 0(p°°). 
Of course, if 

(10.8) (A- \ 2 )u = f eC°°(X), 

then we have, by microlocal elliptic regularity, 

wf sc «c { P = o, e+\v\i = * 2 }- 

In fact, there is a propagation of singularities theorem for scattering 
operators of real principal type that further constrains the scattering 
wavefront set of a solution to (10.8): it must be invariant under the 
(appropriately rescaled) Hamilton vector field of the symbol of A — A 2 . 

55 The distribution ae 1 ^ used here is a simple example of a Legendrian distribu- 
tion. The class of Legendrian distributions on manifolds with boundary, introduced 
by Melrose-Zworski [19], stands in the same relationship to Lagrangian distributions 
as scattering wavefront set does to ordinary wavefront set. 
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Exercise* 10.11. Let u = d(£dp/p 2 + i] ■ d6 / p) and let 

show that up to an overall scaling factor, the Hamilton vector field of 
p with respect to the symplectic form oj is, on the face, p = just 

H p = 2^r j -d r ,-2\r j \ 2 ho d i + H ho 

where h = h\ p=0 , and H ho is the Hamilton vector field of h , i.e. (twice) 
geodesic flow on dX. 

Show that maximally extended bicharacteristics of H p project to the 
6 variables to be geodesies of length it. (Hint: reparametrize the flow.) 

(For a careful treatment of the material in this exercise and indeed 
in this section, see [18].) 

Appendix 

We give an extremely sketchy account of some background material 
on Fourier transforms, distribution theory, and Sobolev spaces. For 
further details, see, for instance, [25] or [11]. 

Let <S(M n ), the Schwartz space, denote the space 

{0 G C°°(M n ) : sup \x a d^(j)\ < oo Va,/3}, 

topologized by the seminorms given by the suprema. The dual space 
to <S(M n ), denoted <S'(IR n ), is the space of tempered distributions. 
For e S(W l ), let 

JT0(£) = (2rr)- n/2 J (f)(x)e-^ x dx. 

Then F<p e S(R n ), too; indeed, J= : S(W n ) ->■ S(W n ) is an isomorphism, 
and its inverse is closely related: 

T'^ix) = (27c)- n/2 J i){t)e + *- x dx. 

We can, by duality, then define J 7 on tempered distributions. 

Let E'iW 1 ) denote the space of compactly supported distributions on 
W 1 . When X is a compact manifold without boundary, we let T>'(X) 
denote the dual space of C°°(X). 

We define the (L 2 -based) Sobolev spaces by 

H s (R n ) = {ue S'{R n ) : (0 a ^«(0 e L 2 (R n )}, 

where (0 = (1 + |£| 2 ) 1/2 . If s is a positive integer, this definition co- 
incides exactly with the space of L 2 functions having s distributional 
derivatives also lying in L 2 . We note that the operation of multiplica- 
tion by a Schwartz function is a bounded map on each H s ; this is most 
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easily proved by interpolation arguments similar to (but easier than) 
those alluded to in Exercise 2.4 — cf. [25]. 

Throughout these notes we will take for granted the Schwartz kernel 
theorem, not so much result to be quoted but world-view. 
Recall that this result says any continuous linear operator 

S{W n ) ->■ S'(R n ) 

is of the form 

u H> J k(x,y)u(y) aly 

for a unique k G S'(M. n x R n ); a corresponding result also holds on 
all the manifolds that we will consider. We thus consistently take the 
liberty of confusing operators with their Schwartz kernels, although we 
let k(A) denote the Schwartz kernel of the operator A when we wish 
to emphasize the difference. 

Some results relating Schwartz kernels to traces are important for our 
discussion of the wave trace. Recall that an operator T on a separable 
Hilbert space is called Hilbert- Schmidt if 

^||T ej || 2 <oo 

j 

where {e-,} is any ortho normal basis. In the special case when our 
Hilbert space is L 2 (X) with X a manifold, the condition to be Hilbert- 
Schmidt turns out to be easy to verify in terms of the Schwartz kernel: 
T is Hilbert-Schmidt if and only if k{T), its Schwartz kernel, 56 lies in 
L 2 {X x X). 

A trace-class operator is one such that 

Y)(Tei, fj)\ < oo 

for every pair of orthormal bases {e^}, {fj}- It turns out to be the case 
that an operator T is trace-class if and only if it can be written 

T = PQ 

with P,Q Hilbert-Schmidt. The trace of a trace-class operator is given 
by 



It is probably best to think of X as a Riemannian manifold here, so that the 
Schwartz kernel is a function, which we can integrate against test functions via the 
metric density, and likewise integrate the kernel. 
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over an orthonormal basis: this turns out to be well-defined. We re- 
fer the reader to [20] for further discussion of trace-class and Hilbert- 
Schmidt operators. 
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